DOCKET NO. 490051.402USPC 
EXPRESS MAIL NO. EV718247176US 



DESCRIPTION 

METHOD AND SYSTEM FOR PRODUCING A TRANSGENIC ORGANISM 

USING METHYLATION 

5 

TECHNICAL FIELD 

The present invention relates to a system, a 
kit and a composition for introducing a foreign nucleic 
10 acid into a cell. More specifically, the present 
invention relates to the production of a transgenic 
organism and a composition, a kit and a system therefor. 

BACKGROUND ART 

15 Transgenic organisms are an important 

technology, which is currently of note at present, due 
to its broad range of applications. However, methods for 
efficiently producing transgenic organisms are not yet 
well developed, and therefore, development of such methods 

20 for efficiently producing transgenic organisms are of 
note . 

Recently, production of transgenic 
biological organisms using transposons has been attempted. 

25 A transposon (or alternatively called " transposable 
element") is a nucleic acid molecule or sequence with 
repeated sequences in parallel. Transposase is an enzyme 
promoting integration of a different nucleic acid into 
a nucleic acid molecule. Usually, the sequence for 

30 transposase exists in the transposon. 

Transposons are found in a relatively broad 
range of biological organisms, and the phenomenon attained 
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thereby is believed to be ubiquitous. Transposition is 
achieved by cut-and-paste mechanism. Transposons have 
also been found in vertebrates (Radice, A.D. et al.^ 
1994. Mol. Gen. Genet. 244,606-612). A number of 
5 transposons have been isolated from a variety of organisms 
such as fish, amphibia, mammals and the like, including 
those belonging to the super families Tcl/mariner, 
hAT (hobo/Ac/Tam) and the like (Oosumi et al . , 1995. Nature 
378, 873; Ivies et al., 1995. Mol. Gen. Genet. 247, 

10 312-322; Koga et al. 1996. Nature 383, 30; Lam et al., 
1996. J. Mol. Biol. 257, 359-366, and Lam, W. L. et al., 
Proc. Natl. Acad. Sci. USA 93, 10870-10875). 
Transposases are known to catalyze or facilitate excision 
from the original location of a transposon and 

15 reintegration thereof (Plasterk, RHA., 1999, TIG 15: 
326-332; Plasterk RHA. , 1996 Curr. Top. Microbiol. Immunol. 
204, 125-143) . An autonomous element of transposons can 
express active transposases, which are a transacting 
factor, and thus have the property of the transposon per 

20 se and can also transpose. A non autonomous element may 
be a cis-acting element, and this may be called as inverted 
terminal repeat sequence. Some inverted repeat sequences 
may include one or more tandem repeat sequences. Such a 
sequence is included in a terminal inverted repeat 

25 sequence and may be used for transposition from another 
element in the presence of a complementary transposase. 

Such a system has been used to attempt introduction 
of a foreign gene into a variety of organisms. 

30 

In plants, transposable elements such as Ac/Ds, Spm 
superf amilies and the like have been routinely used 
(Osborne and Baker, 1995 Curr. Opin. Cell Biol. 
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1 , 406-413) . Recently^ attempts have been made in 
animals^ however, such elements have species specificity, 
and thus it is said that very few attempts have been 
successful. Use of P element from Drosophila 

5 melanogaster for genetic transformation of a cell of a 
non-Drosophila insect. Zebra fish, mammals and the like, 
has not been successful to date (Handler et al., 1993. 
Arch. Insect Biochem. Physiol. 22, 373-384; Gibbs et al. 
Mol. Mar. Biol. Biotech. 3, 317-326; and Rio et al., 1988 
10 J. Mol. Biol. 200, 411-415). The Tcl/mariner superfamily 
member has little species specificity and thus is of note 
today, and has been tried in mammals, such as a human. 
Minos and TcE have also been attempted for such 
applications. 

15 

Sleeping Beauty (SB) has been specified as 
having an activity for promoting transposition of a salmon 
type Tel like transposon using molecular lineage 
classification data. A transposase gene common sequence 

20 has been deduced from eight types of inactive elements 
of the salmon subfamily elements from fish, and engineered 
to eliminate the mutations responsible for inactivation. 
The transposase has been constructed to identify its 
functional domain, and tested individually with respect 

25 to the biochemical functions thereof and from the points 
of the full length transposase. The transposase binds to 
two binding sites in an inverted repeat sequence of the 
salmon element and has substrate specificity, and thus 
allows prevention from cross-transposition between fish 

30 element subfamily members in a closely related species . 
SB transposase enhances not only fish, but also 
significantly enhances chromosomal integration of 
transposons in murine and human cells. The requirement 
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for a specific motif in a transposase and specific 
sequences in a target transposon have established SB 
transposase as the' first active DNA transposon system for 
insertion mutagenesis, in addition to activity in fish 
5 and mammalian cells and the like. In an aspect of the 
present invention, the present invention is related to 
a nucleic acid fragment, which comprises at least a nucleic 
acid sequence located between two inverted repeat 
sequences, and the inverted repeat sequences can bind to 

10 SB protein, and the nucleic acid fragment can be integrated 
into the DNA of the cell. In an embodiment, the cells of 
the present invention may be obtained from, for example, 
invertebrate or vertebrate. Preferable embodiments of 
invertebrates include, but are not limited to, for example, 

15 shrimps, scallop, lobsters, clam oysters and the like. 
Preferable embodiments of vertebrates include fish, birds, 
and mammals, including mice, Ungulata^ sheep, pigs, 
humans and the like. Cellular DNA may be of a cellular 
genome or epichromosomal DNA, including episomes, 

20 plasmids and the like. 

A method for introducing DNA into a cell is known, 
and includes for example, DNA aggregation reagents (e.g. 
calcium phosphate, polyethylene glycol and the like) , 

25 lipid-containing reagents (e.g., liposome, 

multi-lamellar vesicle and the like) , and virus mediated 
methods, and the like. These methods have their own 
deficiencies. For example, DNA aggregation reagents and 
virus mediated methods, have deficiencies where the size 

30 of the DNA is limited, and the amount of nucleic acids 
to be introduced is limited. The promotion of integration 
of the transgene is not always good. 
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There is still a demand for a method for 
introducing DNA into a cell^ in particular, efficient 
integration of a nucleic acid fragment of a variety of 
sizes into the nucleic acid of a cell, specifically a 
5 method for promoting integration of DNA into the genome 
of a cell. 

Z. Ivies et al. (Cell, 91, 501-510 (1997)) 
reported the expression of a transposon in a cultured cell, 

10 in which no effect of the transposon system had been 
confirmed in an adult mammalian individual or an organ 
thereof or the like. For animal cells, a mariner 
transposon has been isolated from Drosophila mauritiana, 
which is a subspecies of fruit fly. This mariner 

15 transposon was used to construct a vector. 

For example, it has been attempted to incorporate 
a P-element transposon of Drosophila melanogaster into 
chromosomal DNA of various heterologous organisms. 

20 However, the function of the P-element transposon was not 
maintained due to species specificity. In experiments 
using flies, such as Muscidae, Sphaeroceridae, Phoridae, 
or the like, the transposition activity of the P-element 
was not maintained (Handler et al.; Arch. Insect Biochem. 

25 Physiol. , 22c 373-384 (1993)). In the case of a transgenic 
zebra fish having an incorporated P-element and reporter 
gene, genetic expression was not stably obtained (Gibbs 
et al.; Mol . Mar. Biol. Biotech., 3:317-326 (1994)). It 
is known that when Tcl/mariner transposons, which are the 

30 most studied eukaryotic transposons, are used in 
heterologous organisms, transposition is likely to occur 
since the species specificity of these transposons is 
relatively low (Z. Ivies etal.;Cell, 91:501-510 (1997)). 
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An example of a transposon system comprising a transposon, 
which is reconstructed from such a Tcl/mariner-like 
transposon, and a transposase, is the above-described 
Sleeping Beauty (SB) transposon system comprising an SB 
5 transposon and an SB transposase. The following examples 
have been reported: the SB transposon was introduced into 
human HeLa cells and mouse LMTK cells (Z. Ivies et al.; 
Cell, 91:501-510 ( 1997 )); the SB transposon was introduced 
into mouse embryonic stem (ES) cells (G. Luc et al. ; Proc. 

10 Natl. Acad. Sci. USA, 95 : 107 69-10773 (1998) ) ; the activity 
of a Caenorhabditis elegans-derived Tel transposon, which 
was introduced into human cultured cells, was observed 
(G. Schouten et al.; Nucleic Acids Res., 26:3013-3017 
(1998)). However, for example, in the case of the 

15 above-described example in which the SB transposon was 
introduced into the mouse ES cells, the frequency of 
transposition of the transposon was 3.5x10"^ per cell, 
having the introduced exogenous gene at the maximum, which 
is considerably low. In this case, a large amount of cells 

20 had to be used in order to obtain a desired cell. The 
example where the SB transposon was introduced into human 
HeLa cells cannot be applied to animal individuals. For 
introduction of transposons into mammals, mice having a 
transposed transposon have been reported, in which the 

25 SB transposon and an SB transposase gene were introduced 
into the genome of somatic cells via the blood (S.R. Yant 
et al. ; Nature Genetics, 25:35-41 (2000)). In this case, 
however, the transposition frequency of the transposon 
was as low as about 5 to 6% of the liver cells having the 

30 introduced gene. This method has such a poor efficiency 
for gene introduction that a transgenic animal line could 
not be obtained. In the case of conventional techniques, 
it is difficult to randomly introduce mutations into a 
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number of genes of an animal individual in vivo. The 
expression frequency is also low. Therefore^ it was 
necessary to design a general method for mutagenesis of 
genes . 

5 

Accordingly, as mentioned above, when using such SB 
system to introduce a gene at the cell level, there are 
many cases where successful transformation has not been 
achieved, and there is a need for increasing 
10 transformation efficiency. 

As such, it is an object of the present invention 
to develop a transposon system to increase transformation 
efficiency and to efficiently achieve production of a 
15 transgenic biological organism. 

SUMMARY OF INVENTION 

The above-mentioned object has been achieved by, in 
20 part, the unexpected discovery by the present inventors 
as a result of study and efforts, that methylation of at 
least a portion of a nucleic acid sequence to be introduced 
increases transformation efficiency, in view of the 
above-mentioned background. 

25 

The present invention is related to the efficient 
introduction of a foreign gene into a cell by means of 
a transposon. In particular, the present invention is 
related to technology for efficiently producing a 
30 transgenic biological organism by methylating a sequence 
comprising a transposon, thereby surprisingly improving 
the transposition activity of the transposon. The 
methylation has been maintained after the integration into 
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the genome^ thereby allowing one to use it to identify 
the actual integration of the gene into the genome. The 
use of the present invention allows extremely efficient 
transformation of a gene^ more than that which can be 
5 achieved with conventional methods for producing a 
transgenic biological organism using a conventional 
transposon . 

Accordingly, the present invention provides 
10 the following: 

1- An isolated nucleic acid molecule having a nucleic 
acid sequence encoding a transposon, wherein the nucleic 
acid sequence has a methylation at at least one nucleotide . 

15 

2. An isolated nucleic acid molecule according to Item 
1, further comprising a nucleic acid molecule encoding 
a desired gene. 

20 3. An isolated nucleic acid molecule according to Item 
1, wherein said methylation is present at least C in a 
CG sequence. 

4. An isolated nucleic acid molecule according to Item 
25 1, wherein said transposon is of a DNA-type. 

5. An isolated nucleic acid molecule according to Item 
1, wherein said transposon belongs to Tcl/mariner types. 

30 6. An isolated nucleic acid molecule according to Item 
1, wherein said transposon comprises Sleeping Beauty, 



An isolated nucleic acid molecule according to Item 
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2, wherein said desired gene is operably linked to said 
transposon, or is capable of being operably linked to said 
transposon when intracellularly introduced. 

5 8. An isolated nucleic acid molecule according to Item 
1 for use in introducing a foreign gene into a host. 

9. An isolated nucleic acid molecule according to Item 
8, wherein said host comprises a eukaryote. 

10 

10. An isolated nucleic acid molecule according to Item 
8, wherein said host comprises a mammal. 

11. An isolated nucleic acid molecule according to Item 
15 8, wherein said host comprises a rodent. 

12. An isolated nucleic acid molecule according to Item 
1, wherein a transposase functions at a location on a 
genome to which said nucleic acid molecule is inserted. 

20 

13. A gene cassette having a nucleic acid sequence 
encoding a transposon, wherein said nucleic acid sequence 
has a methylation at at least one nucleotide. 

25 14- A vector having a nucleic acid sequence encoding a 
transposon and a nucleic acid sequence encoding a desired 
gene, wherein said nucleic acid sequence has a methylation 
at at least one nucleotide - 

30 15. A vector according to Item 14, wherein said 
methylation is present at least C in a CG sequence. 

16. A vector according to Item 14, wherein said 
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transposon is of a DNA-type. 

17. A vector according to Item 14, wherein said 
transposon belongs to Tcl/mariner types. 

5 

18. A vector according to Item 14, wherein said 
transposon comprises Sleeping Beauty. 

19. A vector according to Item 14, wherein said desired 
10 gene is operably linked to said transposon, or is capable 

of being operably linked to said transposon when 
intracellularly introduced. 

20. A vector according to Item 14 for use in introducing 
15 a foreign gene into a host. 

21. A vector according to Item 20, wherein said host 
comprises a eukaryote. 

20 22. A vector according to Item 20, wherein said host 
comprises a mammal. 

23. A vector according to Item 20, wherein said host 
comprises a rodent . 

25 

24 . A vector according to Item 14, wherein a transposase 
functions at a location on a genome to which said nucleic 
acid molecule is inserted. 

30 25. A composition for rendering a transposase to act on 
a foreign nucleic acid molecule to be inserted in a genome, 
wherein said composition comprises a nucleic acid sequence 
encoding a transposon and a nucleic acid sequence encoding 
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a desired gene, wherein said nucleic acid sequence has 
a methylation at at least one nucleotide. 

26. A cell comprising a nucleic acid sequence encoding 
5 a transposon and a nucleic acid sequence encoding a desired 

gene, wherein said nucleic acid sequence has a methylation 
at at least one nucleotide. 

27. A cell according to Item 26, wherein said 
10 methylation is present at least C in a CG sequence. 

28 . A cell according to Item 26, wherein said transposon 
is of a DNA-type. 

15 29. A cell according to Item 26, wherein said transposon 
belongs to Tcl/mariner types. 

30. A cell according to Item 26, wherein said transposon 
comprises Sleeping Beauty. 

20 

31. A cell according to Item 26, wherein said desired 
gene is operably linked to said transposon. 

32. A cell according to Item 26 for use in introducing 
25 a foreign gene into a host. 

33. A cell according to Item 26, wherein said host 
comprises a eukaryote. 

30 34. A cell according to Item 26, wherein said host 
comprises a mammal. 

35. A cell according to Item 26, wherein said host 
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comprises a rodent. 

36 • A tissue comprising a nucleic acid sequence encoding 
a transposon and a nucleic acid sequence encoding a desired 
5 gene, wherein said nucleic acid sequence has a methylation 
at at least one nucleotide. 

37. A tissue according to Item 36, wherein said 
methylation is present at least C in a CG sequence. 

10 

38. A tissue according to Item 36, wherein said 
transposon is of a DNA-type. 

39. A tissue according to Item 36, wherein said 
15 transposon belongs to Tcl/mariner types. . 

40. A tissue according to Item 36, wherein said 
transposon comprises Sleeping Beauty. 

20 41. A tissue according to Item 36, wherein said desired 
gene is operably linked to said transposon. 

42. A tissue according to Item 36 for use in introducing 
a foreign gene into a host. 

25 

43. A tissue according to Item 42, wherein said tissue 
comprises a eukaryotic tissue. 

44. A tissue according to Item 42, wherein said tissue 
30 comprises a mammalian tissue. 

45. A tissue according to Item 42, wherein said tissue 
comprises a rodent tissue. 
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46. A biological organism comprising a nucleic acid 
sequence encoding a transposon and a nucleic acid sequence 
encoding a desired gene, wherein said nucleic acid 
5 sequence has a methylation at at least one nucleotide. 

47 . A biological organism according to Item 46, wherein 
said methylation is present at least C in a CG sequence. 

10 48. A biological organism according to Item 46, wherein 
said transposon is of a DNA-type. 

49. A biological organism according to Item 4 6, wherein 
said transposon belongs to Tcl/mariner types. 

15 

50. A biological organism according to Item 4 6, 
wherein said transposon comprises Sleeping Beauty, 

51. A biological organism according to Item 4 6, wherein 
20 said desired gene is operably linked to said transposon. 

52. A biological organism according to Item 46, which 
comprises a eukaryote. 

25 53. A biological organism according to Item 46, which 
comprises a mammal. 

54. A biological organism according to Item 46, which 
comprises a rodent. 

30 

55. A biological organism according to Item 4 6, wherein 
said desired gene is not derived from said biological 
organism. 
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56. A method for producing a transgenic biological 
organism, comprising the steps of: 

A) providing an isolated nucleic acid molecule 
5 having a nucleic acid sequence encoding a transposon; 

B) transforming a germ cell of a desired biological 
organism with said nucleic acid molecule; 

C) selecting an individual in which the germ cell 
has a methylation in a nucleic acid sequence encoding said 

10 transposon; 

D) regenerating a biological organism using the 
transformed germ cell. 

57. A method according to Item 56, wherein said 
15 biological organism comprises a eukaryote. 

58- A method according to Item 56, wherein said 
biological organism comprises a mammal. 

20 59. A method for producing a transgenic biological 
' organism comprising the steps of: 

A) providing an isolated nucleic acid molecule 
having a nucleic acid sequence encoding a transposon, 
wherein the nucleic acid sequence has a methylation at 

25 at least one nucleotide; 

B) transforming a germ cell of a desired biological 
organism with said nucleic acid molecule; 

C) regenerating a biological organism using the 
transformed germ cell. 

30 

60. A method according to Item 59, wherein said 
biological organism comprises a eukaryote. 
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61. A method according to Item 59, wherein said 
biological organism comprises a mammal, 

62. A method according to Item 59, wherein said 
5 biological organism comprises a rodent. 

63. A kit for producing a transgenic biological organism, 
comprising the steps of: 

A) an isolated nucleic acid molecule having a 
10 nucleic acid sequence encoding a transposon, wherein the 

nucleic acid sequence has a methylation at at least one 
nucleotide; and 

B) a transposase. 

15 64. A kit according to Item 63, further comprising 
instructions indicating a method of use of said nucleic 
acid molecule and transposase. 

65. Use of an isolated nucleic acid molecule having a 
20 nucleic acid sequence encoding a transposon, wherein the 
nucleic acid sequence has a methylation at at least one 
nucleotide for producing a transgenic biological 
organism. 

25 66. A nucleic acid fragment comprising a nucleic acid 
sequence located between at least two inverted repeat 
sequences, wherein the inverted repeat sequences have the 
capability of binding to a transposase, wherein the 
nucleic acid fragment is capable of being incorporated 

30 into a DNA in a cell, wherein at least one nucleotide 
thereof is methylated. 

67. A nucleic acid fragment according to Item 66, 



16 KJ007 



wherein the nucleic acid sequence comprises at least a 
portion of a foreign gene. 

68. A nucleic acid fragment according to Item 66, 
5 wherein said nucleic acid sequence comprises at least one 

expression controlling region. 

69. A nucleic acid fragment according to Item 68, 
wherein the expression controlling region is selected from 

10 the group consisting of a promoter, enhancer and silencer. 

70. A nucleic acid fragment according to Item 66, 
further comprising at least a portion of a foreign gene, 
wherein the nucleic acid sequence is operably linked to 

15 a sequence encoding at least a portion of the foreign gene. 

71. A nucleic acid fragment according to Item 66, 
wherein said cell is derived from an animal. 

20 72. A nucleic acid fragment according to Item 71, 
wherein said cell is obtained from a vertebrate. 

73. A nucleic acid fragment according to Item 72, 
wherein the vertebrate is a mammal. 

25 

74. A nucleic acid fragment according to Item 73, 
wherein the mammal is a primate or a rodent. 

75. A nucleic acid fragment according to Item 66, the 
30 DNA of the cell is selected from the group consisting of 

a cellular genome, episome and plasmid. 



76. A nucleic acid fragment according to Item 66, 
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wherein said at least one inverted repeat sequence 
comprises the sequence set forth in SEQ ID NO: 20 or 21, 
or a portion thereof. 

5 77. A nucleic acid fragment according to Item 66, 
wherein said transposase is SB protein. 

78. A nucleic acid fragment according to Item 77, 
wherein the transposase has at least 80 % amino acid 

10 homology to the sequence set forth in SEQ ID NO: 3. 

79. A nucleic acid fragment according to Item 66, 
wherein said at least one inverted repeat sequence 
comprises at least one tandem repeat sequence, and the 

15 tandem repeat sequence comprises a nucleotide sequence 
set forth in SEQ ID NO: 26 or that having at least 80% 
homology to the sequence set forth in SEQ ID NO: 26. 

80. A nucleic acid fragment according to Item 66, 
20 wherein said at least one inverted repeat sequence 

comprises at least one tandem repeat sequence, wherein 
the tandem repeat sequence is selected from the group 
consisting of nucleic acid sequences set forth in SEQ ID 
NOs: 22-25. 

25 

81. A nucleic acid introduction system for introducing 
into a DNA of a cell another DNA, the system comprising: 

A) a nucleic acid fragment comprising a nucleic acid 
sequence located between at least two inverted repeat 
30 sequences, wherein the inverted repeat sequences have the 
capability of binding to a transposase, wherein the 
nucleic acid fragment is capable of being incorporated 
into a DNA in a cell, wherein at least one nucleotide 
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thereof is methylated; and 

B) a transposase or a nucleic acid encoding a 
transposase . 

5 82 . A nucleic acid introduction system according to Item 
81, therein the transposase is SB protein. 

83 . A nucleic acid introduction system according to Item 
81, wherein said transposase has the amino acid sequence 
10 set forth in SEQ ID NO: 3 or a variant thereof, or the 
nucleic acid sequence encoding the transposase has the 
nucleic acid sequence set forth in SEQ ID NO: 2 or a variant 
thereof. 

15 84 . A nucleic acid introduction system according to Item 
81, wherein said nucleic acid encoding the transposase 
is incorporated into the cellular genome . 

85 . A nucleic acid introduction system according to Item 
20 81, further comprising a plasmid or a virus vector, wherein 
said plasmid or virus vector comprises the nucleic acid 
fragment as a part thereof. 

8 6. A nucleic acid introduction system according to Item 
25 81, wherein nucleic acid fragment comprises at least a 
portion of a sequence encoding a foreign gene. 

87 . A nucleic acid introduction system according to Item 
81, wherein the nucleic acid fragment is introduced into 
30 the cell by means of a method selected from the group 
consisting of particle bombardment; electroporation; 
microinjection; use of a gene introduction reagent; and 
a use of virus vector. 
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88. A method for producing a transgenic biological 
organism, comprising the steps of: 

introducing a nucleic acid fragment and transposase 
5 into a pluripotent cell, wherein the nucleic acid fragment 
comprises a nucleic acid sequence located between at least 
two inverted repeat sequences, wherein the inverted repeat 
sequence have the capability of binding to a transposase, 
wherein the nucleic acid fragment is capable of being 
10 incorporated into a DNA in a cell, wherein at least one 
nucleotide thereof is methylated; and 

growing the cell into a living body. 

89. A method according to Item 88, wherein said 
15 pluripotent cell is selected from the group consisting 

of an oocyte, an embryo, an egg and a stem cell. 

90. A method according to Item 88, wherein said 
biological organism is a rodent or a primate. 

20 

91. A method according to Item 8 9, wherein said 
biological organism is a mouse or a rat. 

92 . A method for introducing a nucleic acid into a DNA 

25 of a cell, comprising the steps of: 

introducing a nucleic acid fragment into a cell, 
wherein the nucleic acid fragment comprises a nucleic acid 
sequence located between at least two inverted repeat 
sequences, wherein the inverted repeat sequences have the 

30 capability of binding to a transposase, wherein the 
nucleic acid fragment is capable of being incorporated 
into a DNA in a cell, wherein at least one nucleotide 
thereof is methylated. 
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93. A method according to Item 92, further comprising 
the step of introducing a transposase into the cell. 

5 94- A method according to Item 92, wherein the 
transposase has at least 80 % homology to the sequence 
set forth in SEQ ID NO: 3. 

95. A method according to Item 92, wherein said cell 
10 comprises a nucleic acid encoding the transposase . 

96. A method according to Item 95, wherein the nucleic 
acid encoding the transposase is incorporated into the 
cellular genome . 

15 

97. A method according to Item 95, wherein the 
transposase is stably expressed in the cell. 

98. A method according to Item 95, wherein the 
20 transposase is operably linked such that it is under the 

control of an inducible promoter. 

99- A method according to Item 92, wherein the nucleic 
acid sequence encodes a protein. 

25 

100. A method according to Item 92, wherein the nucleic 
acid sequence encodes a marker protein. 

101- A method for tranposing a nucleic acid sequence in 
30 a cell, comprising the steps of: 

introducing a transposase into a cell comprising a 
nucleic acid fragment, wherein the nucleic acid fragment 
comprises a nucleic acid sequence located between at least 



21 KJ007 



two inverted repeat sequences, wherein the inverted repeat 
sequences have the capability of binding to a transposase, 
wherein the nucleic acid fragment is capable of being 
incorporated into a DNA in a cell, wherein at least one 
5 nucleotide thereof is methylated; 

wherein the transposase transposes the nucleic acid 
sequence from a first location in a DNA of the cell to 
a second location of the DNA. 

10 102. A method according to Item 101, wherein the DNA of 
the cell is a genomic DNA. 

103. A method according to Item 101, wherein the first 
location is an extrachromosomal DNA. 

15 

104. A method according to Item 101, wherein the second 
location is an extrachromosomal DNA- 

105. A method according to Item 101, wherein the 
20 transposase introduces a nucleic acid into the cell. 

106. A method for identifying a gene in a cell, comprising 
the steps of: 

introducing into a cell, a nucleic acid fragment 
25 comprising a nucleic acid sequence located between at 
least two inverted repeat sequences, wherein the inverted 
repeat sequences have the capability of binding to a 
transposase, wherein the nucleic acid fragment is capable 
of being incorporated into a DNA in a cell, wherein at 
30 least one nucleotide thereof is methylated, and a 
transposase; 

digesting the DNA in the cell with a restriction 
endonuclease capable of digesting the nucleic acid 
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sequence; 

identifying the inverted repeat sequencer- 
determining a sequence of a nucleic acid having a 

similar sequence to the inverted repeat sequence; and 
5 comparing the sequence with sequence information in 

a sequence information database. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 Figure 1 depicts that CpG methylation is 

involved in a high frequency of transposition. 

Figure lA depicts the introduction of a 
single copy of the SB transposon into Sptlc2 locus by 

15 insertion-type homologous recombination and the 
detection of transposon excision by PGR. White square: 
exon, solid triangle: loxP site, gray and white arrows: 
nested PGR primers, less dense line: backbone sequence 
of the plasmid, GAP: GAP promoter, IR/DR-R and IR/DR-L: 

20 right IR/DR and left IR/DR, respectively, Xb: Xbal, B: 
BamHI, K: Kpnl . 

Figure IB depicts excision of SB transposon 
in Sptlc2 locus of an ES cell . ES 115 clone (shown in panel 

25 A) having SB transposon inserted therein by targeting 
Sptlc2 locus, was transfected with sSBlO expressing the 
wild type of the SB transposase (transposase +) or 
pSBlO-DDE expressing the inactive SB transposase with 
deletion of DDE box (transposase -) (Ivies, Z-, P. B. 

30 Hackett, R. H. Plasterk, and Z. Izsvak. 1997 ; Gell 
91:501-510) in a serial dilution. As shown in Panel A, 
the resultant was screened with nested PGR. Four 
independent transf ections for each of the dilution factors 
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of pSBlO were conducted, and eight independent PGR with 
1 (ig of genomic DNA per reaction to conduct screening of 
8jig of genomic DNA per transf ection . Typical PGR results 
with the maximum amount of pSBlO {2^ig) are shown in the 
5 left panel. The right panel shows the results of the 
average number of positive PGR for each transfection and 
the RT-PGR analysis at the expression level, Actb: 
beta-actin. 

Figure IC shows Southern blot analysis of 
methylation state in the transposon sequences of male 
mouse germ line cell and ES cell. EGFP was used as a probe. 
Genomic DNA derived from the germ line cell has not been 
digested by Hpall. This shows that the site has mostly 
been methylated. On the other hand, the genomic DNA from 
the ES115 clone has been substantially completely digested. 
The presence of minor band (shown as asterisk) shows that 
a small fraction of Hpall site has been methylated in an 
ES cell. Solid circle: Hpall (H) or Mspl (M) site; X: Xhol. 
site; 0.5 kb bands derived from Hpall-Mspl fragment in 
KM, KXM and KXH lanes are shown with arrow. 

Figure ID shows a result of an assay of temporary 
transposon excision . 
25 a) Experimental procedure is shown. Transposon DNA 
(pTransCX-EGFP: neo, Horie et al., Proc. Natl. Acad. Sci. 
USA 2001) has been methylated in advance with SssI CpG 
methylase. Murine erythroleukemia cell (MEL cell) was 
introduced with transposon DNA and Sleeping Beauty (SB) 
30 transposase. Total DNA was extracted from the cell, and 
PGR was conducted with the plasmid vector using primers, 
and 358 bp PGR products detected, which are amplified when 
the excision reaction is caused. 
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b) The result of the PGR is shown. A cell in which 
methylated transposon has been introduced with a 
transposition enzyme, showed excision reactions more 
often than a cell with unmethylated transposons. 
5 Abbreviation: CAG: GAG promoter, EGFP: green fluorescent 
protein, pA: poly addition signal, L: left IR/DR, R: right 
IR/DR, M: methylation, N: non-methylation, NC: negative 
control (MEL cell, genomic DNA) . 

Figure 2 shows a figure of establishment of 
a cell having a methylated or unmethylated transposon in 
the same genetic locus of a murine genome. Figure 2A 
depicts a flow chart of an exemplary experiment. 
Figure 2B shows a result of Southern blotting analysis. 
Figure 2C-D show examples of exemplary results for 
confirming maintenance of methylation. Figure 2D shows 
production of cells having methylated SB transposon or 
non-methylated SB transposon in the predetermined locus 
of the genome. 

Figure 2E shows a fluorescent activated cell 
sorting (FAGS) for EGFP expression in a target clone. 
Gray region: wild-type cell. Light line: target clone. 

25 Figure 3A is an example showing the effects 

of DNA methylation on the reaction of transposon excision 
in the murine genome. 

Figure 3B-C depicts the details of frequent 
30 excision of methylated SB transposon in a predetermined 
locus of the genome- A portion thereof overlaps with 
Figure 3A. 
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One microgram or 10 nanograms of clones shown 
in Figure 2 have been used as a template for nested PGR. 
For each clone, 10 rounds of PGR have been performed. (B) 
Direction D; (G) Direction I. NG: genomic DNA which has 
5 not been transfected as a negative control, M: 100 bp 
ladder. Right panel shows methylation state of the 
transposon region and the parent clone. 

Figure 4 depicts an example showing the 
10 effects of insertion of a transposon into the genome. A 
depicts an exemplary plasmid construct used herein. B 
shows methylation (left) and non-methylation (right) . 

Figure 4B depicts effects of GpG methylation 
15 on the SB transposition. Average values of G418 resistant 
colony per culture dish against three independent 
transf ections are shown on the right panel. Error bar 
shows standard deviation. 

20 Figure 4C depicts the structure of a gene 

trapped and transposon insertion site. The insertion 
site is shown with black arrows. Glones M1-M4 are derived 
from transfection with methylated transposon, and Glones 
N1-N6 are derived from non-methylated transposon. The 

25 number of chromosomes, ensemble gene designator, and the 
gene nomenclature are also shown. Except for Clone M2, 
in which two insertion sites have been characterized 
(called M2S and M2L) , one insertion site has been 
identified for each clone. Correct splicing in some 

30 clones (M2S, M2L, M4 and Nl) have been confirmed with 
RT-PGR using primers to the upstream exon (white arrow) 
and transposon. Solid square: exon. Scale bar for 5kb 
is shown on the right hand side. 
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Figure 4D shows the effect of methylation on 
Tc transposition. Construct for the trapping vector is 
shown in the upper panel. TIR: terminal inverted repeat. 
5 Three independent transf ections are shown in the lower 
panel with respect to the average number of G418 resistant 
colonies per dish. 

Figure 5 shows an alignment of typical 
10 transposon sequences. Multiple alignment with X01005 
(1-1610), Z29098 (15-1787), Z29102 (15-1787), U11641 
(188-1451), U11652 (146-1442) and L48685 (1-1455) are 
shown (the numerals in parentheses show the range of base 
sequence in the sequence of each Accession number) 

15 

Figure 6 shows invariant affinity of SB 
transposon DNA binding domain against naked methylated 
IR/DR. 

20 Figure 6A shows an exemplary drawing of CpG 

sites in the IR/DR and transposase binding sites. Solid 
circle: CpG sites. 

Figures 6B-E shows an electrophoretic 
25 mobility shift assay (EMSA) in the recombinant SB 
transposase peptide (N123) . 

Figures 6B and 6C show results obtained in the 
external binding site of IR/DR-L 34 bp, and Figures 6D 
30 and 6E show the results obtained by IR/DR-L fragment of 
300 bp. 

As shown in Figure 6B, a concentration of N123 
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peptide sufficient for enhancing non-methylated or 
methylated external binding sites (1600 to 100 dilution 
of the purified peptide) mixed therewith to form a 
nucleoprotein complex . 

5 

As shown in Figure 6C, non-methylated 

external binding site has been labeled for use as a probe, 
and non-methylated external binding site or methylated 
external binding site have been used as a competitive 
10 substance in an increased concentration (10-50 folds molar 
excessive probes) . 

As shown in Figure 6D^ non-methylated IR/DR-L 
fragment or methylated IR/DR-L fragment have been mixed 
15 with an increased concentration of N123 peptides (5100-160 
fold dilution of the purified peptides) for formation of 
nucleotprotein complex . 

As shown in Figure 6E, non-methylated IR/DR-L 
20 fragment was labeled for use as a probe,, and non-methylated 
or methylated IR/DR-L fragments have been used as a 
competitive substance in an increased concentration 
(500-8000 fold molar excess of probes). Unmet: 
non-methylated. Met: methylated, F: free probe, C: complex. 
25 Complexes 1 (CI) and 2 (C2) indicate one and two 
molecule (s) of N123 peptides per IR/DR, respectively 
(Ivies, Z., P. B. Hackett, R. H. Plasterk, and Z . Izsvak. 
1997; Cell 91: 501-510) . 

30 Figure 7 shows ChIP assay of methylated 

transposon or non-methylated transposon in a 
predetermined locus . 
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Figure 7A shows SB transposon region^ which 
was analyzed in the ChIP assay. PGR amplification region 
is shown as light line under each component. Solid 
triangle: lox 511 site. 

5 

Figures 7B and 7C show PGR analysis of 
precipitated DNA. As shown in Figures 3 and 4, Clones 5M3 
and 5U3 (B) and 6M11 and 6U1 (C) are derived from parent 
clones RL5 and RL6, respectively. Input and precipitated 

10 DNA have been analyzed in a five-fold stepwise dilution. 
AcH3: anti-acetylated H3, MeH3K9; anti-trimethylated 
H3K9, no Ab: control having no antibodies. Amylase 2.1 
and beta-small globin have been used as a typical controls 
against heterochrome regions and euchromatin regions. 

15 Intensity of bands of amylase 2.1 and beta-small globin 
are the same in the anti-acetyl H3 fraction, whereas the 
higher amplification efficiency than amylase 2 . 1 sequence 
in the input DNA, indicates that beta-small globin is 
enriched compared with the amylase 2.1 sequence in this 

20 fraction. Minor bands found in the control lanes without 
antibodies are derived from non-specific binding of the 
genomic DNA to the protein. Agarose beads are used for 
preparation. Determination was conducted twice. 

Representative results are shown. 

25 

In the Figures, ""GFP" refers to the GFP gene. 
As used herein, the transposon used in the Sleeping Beauty 
(SB) transposon system are SB transposon, and the 
transposase is the SB transposase. In the figures, ^^SB'' 
30 indicates the Sleeping Beauty transposase gene. 



DESGRIPTION OF SEQUENGE LISTING 
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SEQ ID NO: 1: DNA sequence of the transposon SB (Sleeping 
Beauty) (GENBANK Accession No. : L48685) 
SEQ ID NO: 2: Nucleic acid sequence of SB transposase 
SEQ ID NO: 3: Polypeptide sequence of SB transposase 
5 polypeptide 

SEQ ID NO: 4: Nucleic acid sequence of left outer repeat 
sequence 

SEQ ID NO: 5: Nucleic acid sequence of left internal repeat 
sequence 

10 SEQ ID NO: 6: Nucleic acid sequence of TgTP-lU 
SEQ ID NO: 7: Nucleic acid sequence of TgTP-2L 
SEQ ID NO: 8: Nucleic acid sequence of TgTP-2U 
SEQ ID NO: 9: Nucleic acid sequence of TgTP-3L 
SEQ ID NO: 10: Nucleic acid sequence of transposable 

15 element Tclderived from Caenorhabditis elegans (GENBANK 
Accession X01005) 

SEQ ID NO: 11: Amino acid sequence of transposable element 
derived from Caenorhabditis elegans (GENBANK Accession 
X01005) 

20 SEQ ID NO: 12: Nucleic acid sequence of Minos-2 derived 
from Drosophila hydei (GENBANK Accession Z29098) 
SEQ ID NO: 13: Drosophila hydei iyiinos-2 amino acid sequence 
(GENBANK Accession No.: Z29098) 

SEQ ID NO: 14: Drosophila hydei Minos-3 nucleic acid 
25 sequence (GENBANK Accession No.: Z29102) 

SEQ ID NO: 15: Drosophila hydei Minos-3 amino acid sequence 
(GENBANK Accession No.: Z29102) 

SEQ ID NO: 16: Haematobia irritans Hi2 mariner nucleic 
acid sequence (GENBANK Accession No. : U11641) 
30 SEQ ID NO: 17: Haematobia irritans Hi2 mariner amino acid 
sequence (GENBANK Accession No.:1641) 

SEQ ID NO: 18 : Chrysoperla plorabunda mariner nucleic acid 
sequence (GENBANK Accession No.: U11652) 
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SEQ ID NO: 19: Chrysoperla plorabunda mariner amino acid 
sequence (GENBANK Accession No. : U11652) 
SEQ ID NO: 20: IR/DR-R sequence is shown: 
aattccatcacaaagctctgacctcaatcctatagaaaggaggaatgagccaa 
5 aattcacccaacttattgtgggaagcttgtggaaggctactcgaaatgtttga 
cccaagttaaacaatttaaaggcaatgctaccaaatactaattgagtgtatgt 
taacttctgacccactgggaatgtgatgaaagaaataaaagctgaaatgaatc 
attctctctactattattctgatatttcacattcttaaaataaagtggtgatc 
ctaactgaccttaagacagggaatctttactcggattaaatgtcaggaattgt 
10 gaaaaagtgagtttaaatgtatttggctaaggtgtatgtaaacttccgacttc 
aactgta 

SEQ ID NO: 21: IR/DR-L sequence is shown: 

ccttgaaatacatccacaggtacacctccaattgactcaaatgatgtcaatta 

gtctatcagaagcttctaaagccatgacatcattttctggaattttccaagct 

15 gtttaaaggcacagtcaacttagtgtatgtaaacttctgacccactggaattg 
tgatacagtgaattataagtgaaataatctgtctgtaaacaattgttggaaaa 
atgacttgtgtcatgcacaaagtagatgtectaactgacttgccaaaactatt 
gtttgttaacaagaaatttgtggagtagttgaaaaacgagttttaatgactcc 
aacttaagtgtatgtaaacttccgacttcaactgta 

20 SEQ ID NO: 22: example of repetitive portion of an 
inverted repeat sequence: gttcaagtcg gaagtttaca 
tacacttag 

SEQ ID NO: 23: example of repetitive portion of an inverted 
repeat sequence: cagtgggtca gaagtttaca tacactaagg 

25 SEQ ID NO: 24 : example of repetitive portion of an inverted 
repeat sequence: cagtgggtca gaagttaaca tacactcaat t 
SEQ ID NO: 25: example of repetitive portion of an inverted 
repeat sequence: agttgaatcg gaagtttaca tacaccttag 
SEQ ID NO: 26: example of common portion of repetitive 

30 portion of an inverted repeat sequence: caktgrgtcr 
gaagtttaca tacacttaag 

SEQ ID NO: 27: example of left outer repeat 
SEQ ID NO: 28: example of left inner repeat 



31 



KJ007 



SEQ ID NO: 29: forward sequence used for PGR amplification 
from pCMV-SB 

SEQ ID NO: 30: reverse sequence used for PGR amplification 
from pCMV-SB 
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fluorescence labeled probe for excision 



production 

SEQ ID NO: 42: fluorescence labeled probe for excision 
production 

SEQ ID NO: 43: fluorescence labeled probe for neo fragment 
20 SEQ ID NO: 44: fluorescence labeled probe for neo fragment 
SEQ ID NO: 45: PGR primer for excision product 
SEQ ID NO: 46: PGR primer for excision product 
SEQ ID NO: 47: PGR primer for neo fragment 
SEQ ID NO: 48: PGR primer for neo fragment 
25 SEQ ID NO: 49: primer LGB2XL2 
SEQ ID NO: 50: primer PGK2 
SEQ ID NO: 51: primer LGB2XL1 
SEQ ID NO: 52: primer PGK4 

SEQ ID NO: 53: forward primer for amplification of SB 
30 transposase gene 

SEQ ID NO: 54: reverse primer for amplification of SB 
transposase gene 

SEQ ID NO: 55: forward primer for amplification of 
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beta-actin transposase gene 
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25 BEST MODE FOR CARRYING OUT THE INVENTION 

Hereinafter the present invention is described. 

It should be understood throughout the 
present specification that expression of a singular form 
30 includes the concept of their plurality unless otherwise 
mentioned. Specifically, articles for a singular form 
(e.g., ^^a'S ^^an", ^^the'% etc. in English; ^^ein'', ^Mer", 
^Mas", ^Mie", etc. and their inflections in German; ^^un". 
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^^une'\ ^^le", ^^la", etc. in French; ^^un", ^^una", ^^el", ^^la", 
etc. in Spanish, and articles, adjectives, etc. in other 
languages) include the concept of their plurality unless 
otherwise mentioned. It should be also understood that 
5 the terms as used herein have definitions typically used 
in the art unless otherwise mentioned. Thus, unless 
, otherwise defined, all scientific and technical terms have 
the same meanings as those generally used by those skilled 
in the art to which the present invention pertain. If 
10 there is contradiction, the present specification 
(including the definition) precedes. 

(Definition and Description of Terms) 
Hereinafter, the definition of terms 
15 specifically used herein are listed. 

Transposon as used herein refers to a nucleic acid 
molecule or nucleic acid sequence which is capable of 
moving (transposition) from one site to another on a 

20 chromosome. Typically, transposon is a DNA segment (DNA 
transposon) . DNA transposons (hereinafter simply 
referred to as ^^transposons'') are activated by a 
transposase and are then transposed. Transposons include, 
but are not limited to, for example, SB transposon (Acc. 

25 No. L48685; SEQ ID NO: 1), and those included in the 
sequences set forth in SEQ ID NOs: 10-19, and the like. 

As used herein, the term ^^transposase" refers to an 
enzyme having catalytic activity of recognizing a 
30 transposon and transposing the transposon. Transposases 
include, but are not limited to, for example, SB 
transposases (SEQ ID NOs: 2, 3) and those included in the 
sequences set forth in SEQ ID NOs: 10-19, and the like. 
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In the present invention, any transposon 
sequences and transposase genes used in a transgenic 
biological organism, regardless of being internal or 
5 external, preferably external transposon sequences and 
transposase genes, can be used. 

Transposons usually have a repeated sequence 
(hereinafter referred to as a ^^transposon sequence") on 

10 each end thereof. This repeated sequence is a transposase 
recognition site. Examples of such repeated sequences 
include, but are not limited to, for example, SEQ ID NO: 
22-26. The transposon sequence may include an imperfect 
repeated portion if the transposon can be transposed by 

15 action of a transposase. An insertion recognition site 
having a length specific to transposons within DNA into 
which a transposon is to be inserted is called a target 
sequence. For example, in the case of the Sleeping Beauty 
(SB) transposon system (Z. Ivies et al.; Cell, 91:501-510 

20 (1997)), the target sequence is TA. After a transposon 
is inserted, the sequence is TA-transposon-TA. 
Accordingly, as used herein, transposon sequence refers 
to a sequence recognized by a transposase, and 
transposable natural or artificial transposon. sequence 

25 in a desired organism. 

As target sequences of transposons, for example, TA, 
ATAT, TATATA, TACA, and the like are known. Regardless 
of the type of target sequences, the object which enhances 
30 efficiency of gene introduction by a transposon has been 
achieved by methylation of the transposon in the present 
invention. 
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Transposons include mainly those self-completing 
type, internally including enzymatic transposase which 
can catalyze transposition per se, and those 
self-incompleting type, lacking transposase activity. 
5 Biological organisms, each having a transposon sequence 
and a transposase gene are bred. When biological 
organisms having a transposon sequence and a transposase 
gene are obtained, or when biological organisms having 
a transposon sequence whose transposition is fixed, 

10 (without a transposase gene) are obtained, non-self 
completed types of transposons are used. When biological 
organisms including both transposons and transposases or 
those comprising signature and lacking a transposon and 
a transposase gene are obtained, both self completing type 

15 and non-self completing type transposons can be used. The 
self completing type transposon may be converted to a 
non-self completing type by excising a transposase, by 
using the Cre loxP system, which is described below, or 
breeding a biological organism having the loxP sequence 

20 at both sides of the transposase gene and nqn biological 
organisms having Cre. Further, transposons of 

host-dependent type and host-independent type both can 
be used. Generally, the host-independent type can be 
used. 

25 

In the present invention, transposon only 
and/or transposase only may be excised from the transposon 
system comprising a transposon and a transposase gene for 
use, or transposase may be ' inactivated to make the 
30 transposon the non-self completing type for use. A 
transposon system useful for practicing the present 
invention, includes any sequence which is transposable 
in a biological cell, and preferably a mariner superf amily 
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member may be used. Examples thereof include for example, 
transposon family such as Tel, SB, Minos, Txr, To; mariner 
transposon family including Caenorhabditis elegans, Mosl, 
Hyalophora cecropia and the like; Pogo transposon family 
5 such as Pogo, Tigger,"4 and the like (R.H. Plasterk et 
al.; Trends in genetics, 15: No. 8: 326-332(1999)). Most 
preferably, the SB (Sleeping Beauty) transposon may be 
used. Non-self completing transposon system may be 
obtained by removing or inactivating a transposase gene 
10 in a self completing type transposon. 

As used herein, ^^DNA type'' transposon refers 
to a transposon for use in transposing a DNA. Usual 
transposons are of DNA type. In typical embodiments, the 
15 present invention is practiced by using a DNA-type 
transposon. 

As used herein, the term ^^Tcl/mariner type 
transposon" refers to a transposon similar to Tcl/mariner . 
20 Transposons belonging to Tcl/mariner type include, but 
are not limited to: minos, SB, Tel, mariner and the like. 
This superfamily is believed to be derived from a 
vertebrate genome (Radice et al., 1994; Smit and Riggs, 
1996 Proc. Natl. Acad. Sci. USA 93, 1443-1448). 

25 

As used herein the term ^^SB" and ^^Sleeping 
Beauty'' refer to a transposon of the Tcl/mariner type, 
and which has transposon activity in a mammal or a 
mammalian cell (Ivies et al.. Cell 91:501-510, 1997) . As 
30 used herein the term SB may be used as SB polypeptide, 
SB transposon and the like. 
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transposon sequences may be a variety of DNA sequences 
(for example, marker genes, gene expression regulatory 
sequences, desired genes and the like) . A variety of 
transposon constructs combining a variety of elements as 
5 necessary, in addition to a transposon sequence, can be 
constructed. In the present invention, a target cell for 
introduction of a transposon construct or transposase gene, 
may be a cell having the potential to differentiate into 
an individual of a biological organism, preferably a 
10 non-human organism, and such cells include, for example, 
stem cells or fertilized cells. 

The transgenic biological organism of. the 
present invention includes founder (not only the first 

15 generation but also those lineages established based on 
the founder are of course encompassed by the present 
invention) having either or both a transposon construct 
and a transposase. Further, organs, tissues, eggs, 
sperms and fertilized eggs derived from the transgenic 

20 biological organism lineage of the present invention, 
established cell lines established from a lineage of the 
transgenic biological organism, cloned individuals 
produced from the transgenic biological organisms, are 
also encompassed by the present invention. Transposon 

25 constructs of the present invention may be constructed 
by combining a transposon sequence with a variety of other 
elements and introduced into a stem cell or fertilized 
egg or the like. 

30 (Development of transposon vectors having high gene 

disruption efficiency) 

Hereinafter, the transposon system 
development which the inventors have conducted is 
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described. Transposon system as used in the present 
invention usually consists of two elements of a transposon 
sequence, a DNA sequence transposing on the genome, and 
a transposase, an element catalyzing the transposition. 
5 If there is a gene in the place to be transposed, the 
function of the gene may be disrupted. However, most of 
the genes are composed of introns, and thus even if it 
is inserted into inside the gene, there is a possibility 
of elimination of the transposon sequence when splicing. 

10 In order to prevent this elimination, it is possible to 
construct a construct having a splice acceptor inside the 
structure. A transcription product of an internal gene 
is stopped in the transposon, and as a result, it is 
expected that the function of the internal gene is 

15 efficiently disrupted. In order to investigate this 
hypothesis, for three lineages of mice which have been 
inserted with transposons inside the gene, homozygous mice 
are obtained, and tail RNA are used to conduct RT-PCR using 
primers corresponding to exons upstream and downstream 

20 of insertion sites, to confirm elimination of bands 
corresponding to the normal gene product. Furthermore, 
it is possible to confirm the expression of the lacZ gene 
or the like for the existence of expected splicing. As 
such, verification of extremely high gene disruption 

25 efficiency of the produced vector can be confirmed. 
Moreover, when tested for determination of transposition 
of 200 or more sites for transposition distribution, it 
can be confirmed that about 80 % of the transpositions 
are in the identical chromosome and the remaining 20 % 

30 are transposed to other chromosomes. Accordingly, in 
order to introduce exhaustively mutation into the entire 
genome, it is desirable to produce a mouse having a 
transposon in a variety of chromosomes. Based on this 
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idea, to date, a nuinber of lineages of mice having 
transposons at different sites on the genome have been 
established. In fact, the present inventors have 
established more than 20 lineages. In the present 
5 procedures, it is important that the transposon insertion 
site per se, before transposition, have - no effects on 
phenotypes, and thus systems satisfying these conditions 
are identified. On the other hand, when transposition 
happens in an identical chromosome, the transposition 

10 appears to be concentrated in the distance of within 3Mb 
before the transposition. This suggests the possibility 
of introduction of a variety of mutations onto a specific 
genetic locus. In fact, in some lineages of mice, 
insertion of transposons into the Neurexin 3 gene is 

15 identified in four mice. From the Neurexin 3 gene, 
different promoters are used to produce alpha and beta 
types of two proteins, and the present inventors have also 
found that it is recognized that the four mice have 
identified two out of four species the alpha-type only, 

20 and the remaining two types of both alpha and beta types 
have been introduced at locations with mutations. Site 
specific mutagenesis of this type is believed to be a 
specific property of transposons, and is expected to have 
future applications . 

25 

In the present invention, transposons are 
methylated in vitro, and introduced to an ES cell together 
with a transposase expression vector, resulting in 
significant increase in the transposition efficiency into 
30 the genome. Further, in order to assay transposition 
efficiency in the genome, a methylated transposon or a 
transposon in the state of methylation, is inserted into 
a specific site of the genome. After methylation of the 
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transposon and the state of methylation has been confirmed 
to be maintained after insertion into the genome/ when 
a transposase is expressed in the cell, methylation 
introduction is greater than 100 times more efficient than 
5 transposition using a transposon having no methylation. 
It was also found that the transposon region has been 
heterochromatinated, which indicates the possibility of 
enhancing transposition frequency of modifying the state 
of the genome of the transppson region. 

10 

As used herein the term ''^methylation" refers 
to the addition of a methyl group to a nucleic acid molecule . 
If similar effects are found, the methyl group may be a 
similar group such as lower alkyl group. Methylation is 
15 catalyzed by a methylation enzyme (methylase) in vivo. 
As used herein methyl group is provided by a substrate 
via a methyl group transposition response. 

One example is that a methyl group is produced as 
20 5-methyl tetrahydrof olate via enzymatic reduction of 
5/ 10-methylene tetrahydrof olate, and transferred to a 
homocysteine via a type of cobamide enzyme to result in 
a methionine. The methionine is converted to S-adenosyl 
methionine via action of ATP, which then acts as methyl 
25 donor which is used for production of a variety of methyl 
products. Betaine, which is an oxidated type of choline, 
may also act as a methyl donor. Several tens of methyl 
group transferase specific to each compound have been 
elucidated, and as necessary, an appropriate enzyme may 
30 be selected by those skilled in the art, and may be used 
in the present invention. Alternatively, methylation may 
be added by chemical reaction in addition to the use of 
an enzyme. 
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As used herein^ the confirmation of methylation may 
be conducted by means of any conventional technology known 
in the art. Such a method includes, but is not limited 
5 to for example, confirmation of changes in physical 
behavior by methylation, bisulfite modified sequencing 
and the like (GitanRS, etal.. Genome Res. Jan; 12: 158-64. 
2002; Lilischkis R. , et al. Diagn. Mol. Pathol. 2000; 9: 
165-71. ) . 

10 

In a system such as those using transposon 
<- like SB, each inverted repeat typically comprises at least 
one direct repeat. The gene transposition system of the 
present invention includes, therefore two constructs: 
15 transposase and cloned non-self completing (i.e. non-self 
insertion) salmon type element or a transposon including 
transposon substrate DNA of an inverted repeat. When 
mixing together, these two constructs provide active 
transposon activity. 

20 

As used herein the term ^^inverted repeat 
sequence'' is a sequence acting in a transposon, and has 
often a structural feature of having an inverted repeat 
of 15-40 base pairs at both termini, such as those of SEQ 

25 ID NOs : 22-26. Typical sequences used herein are set 
forth in SEQ ID NOs: 20-21. These inverted repeat 
sequences are insertion sequences, and one of 
transposition elements. Each inverted repeat sequence 
preferably comprises at least one tandem repeat sequence 

30 (i.e. called IR/DR) . Transposon elements may be a linear 
nucleic acid fragment (for convenience, from the 5' 
terminus to the 3' terminus) which can be used as a linear 
fragment, or for example, circular in a plasmid. In a 
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preferable embodiment, two direct repeat sequences occur 
in each inverted repeat sequence. 

Preferable inverted repeat sequences herein 
5 include, but are not limited to, for example, the inverted 
repeat sequence of the Tcl/mariner type transposon and 
the SB transposon, and the like. Amongst these, those 
binding to SB include but are not limited to, the inverted 
repeat sequence of the SB transposon. 

10 

Preferable direct repeat sequences herein 
include, but are not limited to, for example, direct repeat 
sequences of Tcl/mariner type transposon and SB transposon 
and the like. Amongst these, those binding to SB include 
15 but are not limited to, direct repeat sequence of SB 
transposon. 

The direct repeat sequences are typically 
about 25 to about 35 base pairs in length, and preferably 

20 about 29 to about 31 base pairs in length. However, 
regardless of this, an inverted repeat may include the 
sole direct repeat. In these cases, it is not an actual 
repeat, but as described below in detail, these are 
nucleotide sequences having at least about 80 % identity 

25 to the consensus direct repeat sequence. The transposon 
element may be linear chain of nucleic acid fragments, 
which may be used as a linear fragment or a circular 
fragment, for example, a plasmid (conventionally, 
extending from the 5' terminus to the 3' terminus). 

30 

In a preferred embodiment of the transposon, there 
are two direct repeats in each inverted repeat sequence. 
The direct repeats (which number, in this embodiment, is 
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four) have similar nucleotide sequences, as described in 
more detail below. An inverted repeat on the 5 'or 
"left-hand" side of a nucleic acid fragment of this 
embodiment typically comprises a direct repeat (i.e., a 
5 left outer repeat) , an intervening region, and a second 
direct repeat (i.e., a left inner repeat). An inverted 
repeat on the 3' or "right-hand" side of a nucleic acid 
fragment of this embodiment comprises a direct repeat (i.e. 
a right inner repeat) , an intervening region, and a second 
10 direct repeat (i.e. a right outer repeat). 

Because these are inverted with respect to 
each other on the nucleic acid fragment, the direct repeats 
in the 5 ' inverted repeat of the nucleic acid fragment 
are in a reverse orientation compared to the direct repeats 
in the 3' inverted repeat of the nucleic acid fragment. 
The intervening region within an inverted repeat is 
generally at least about 150 base pairs in length, 
preferably at least about 160 base pairs in length. The 
intervening region is preferably no greater than about 
200 base pairs in length, more preferably no greater than 
about 18 0 base pairs in length. The nucleotide sequence 
of the intervening region of one inverted repeat may or 
may not be similar to the nucleotide sequence of an 
intervening region in another inverted repeat. 

Most transposons have perfect inverted 
repeats, whereas the inverted repeats that bind SB protein 
generally have at least about 80% identity to a consensus 
30 direct repeat, preferably about 90% identity to a 
consensus direct repeat. Preferable consensus direct 
repeats, include but are not limited to, sequences such 
as SB transposon sequence and the like. 
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Sites which are hypothesized to be the core 
binding site of the SB protein, are present in the 123 
amino acids from the N-terminus of SEQ ID NO: 3. 
5 Nucleotide identity is determined by homology search 
between the sequence set forth in SEQ ID NO: 1 or the like 
and the direct repeat. 

Examples of direct repeat sequences that bind 
10 to SB protein include but are not limited to those sequence 
of SB transposon. An example of a left outer repeat 
includes, but is not limited to: 5' 

gttgaagtcggaagtttacatacacttag-3 ' (SEQ ID NO: 21). An 
example of a left inner repeat includes, but is not limited 
15 to: 5 • -cagtgggtcagaagtttacatacactaagg-3 • (SEQ ID NO: 28). 
Examples of a right inner repeat includes, but is not 
limited to those similar to left inner repeat (for example, 
having identity of 90 % or greater) . Examples of right 
outer repeat include, but are not limited to those similar 
20 to left outer repeat sequence (for example having identity 
of 90 % or greater) . 

The inverted repeat may include the poly (A) 
signal AATAAA. This poly (A) signal can be utilized by 

25 a coding sequence present in the nucleic acid fragment 
to result in addition of a poly (A) tail to an mRNA. The 
addition of a poly (A) tail to an mRNA typically results 
in increased stability of that mRNA relative to the same 
mRNA without the poly (A) tail. Preferably, the inverted 

30 repeat is present on the 3' of a nucleic acid fragment 
that comprises two direct repeats in each inverted repeat 
sequence . 
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The inverted repeat will be flanked by a 
nucleic acid sequence to be inserted in the DNA in a cell. 
The nucleic acid sequence may comprise the entire or 
partial open reading frame of the gene (i.e., a portion 
5 of the gene encoding a protein) , alone or with one or more 
expression regulatory sequences (i.e. regulatory 
sequence in the nucleic acid) with the entire or partial 
of the open reading frame. Preferable expression 
regulatory sequences include those elements including, 
10 but not limited to promoters, enhancers, boarder 
regulatory elements, genetic locus regulatory regions or 
silencers. In a preferable embodiment, the nucleic acid 
sequence includes a promoter operably linked to a portion 
of the open reading frame. 

15 

Transposons may transpose from the first 
location to the second location on the DNA in the presence 
of transposase (also called mobility) . Any mobile 
cut-and-paste type transposon includes two basic elements, 
20 and the origin of active transposase and a mobile DNA 
sequence which is recognized by the transposase. 
Mobility of the DNA sequence allows transposition of 
intervening nucleic acids between the recognized DNA 
sequences. 

25 

As used herein the term "foreign gene" refers 
to a gene which is intended to be introduced via gene 
transfer of the present invention or a nucleic acid 
molecule encoding the same. Such a foreign gene is 
30 derived from a host of different origin from the host which 
the introduction is intended or the same host- As long 
as introduction is intended, the nucleic acid sequence 
encoding the foreign gene may encode any protein. In one 
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embodiment the protein encoded by the nucleic acid 
sequence is a marker protein such as GFP^ chloramphenicol 
acetyltransf erase (CAT), p-galactosidase (lacZ) , and 
luciferase (LUC) . In another embodiment, the protein 
5 encoded by the nucleic acid is a growth hormone, for 
example to promote growth in a transgenic animal, or 
insulin-like growth factors (IGFs) . Alternatively, the 
foreign gene may not encode a protein. 

In one embodiment of a transgenic 
animal, the protein encoded by the nucleic acid fragment 
is a product for isolation from a cell. Transgenic 
animals as bioreactors are known. Protein can be produced 
in quantity in milk, urine, blood or eggs. Promoters are 
known that promote expression in milk, urine, blood or 
eggs and these include, but are not limited to, the casein 
promoter, the mouse urinary protein promoter, beta-globin 
promoter and the ovalbumin promoter, respectively. 
Recombinant proteins are produced by means of other 
methods for producing a protein in a cell. Nucleic acids 
encoding these or other proteins can be incorporated into 
the nucleic acid fragment of this invention and introduced 
into a cell. Efficient incorporation of the nucleic acid 
fragment into the DNA of a cell occurs when a composition 
of the present invention is present. Where the cell is 
part of a tissue or part of a transgenic animal, large 
amounts of recombinant protein can be obtained. 

(Cells and Biology) 
30 The term ^^cell" is herein used in its broadest 

sense in the art, referring to a structural unit of tissue 
of a multicellular organism, which is capable of self 
replicating, has genetic information and a mechanism for 
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expressing it, and is surrounded by a membrane structure 
which isolates the living body from the outside. Cells 
used herein may be either naturally-occurring cells or 
artificially modified cells (e.g., fusion cells, 
5 genetically modified cells, etc.). Cells used herein may 
be either naturally-occurring cells or artificially 
modified cells (e.g., fusion cells, genetically modified 
cells, etc.), as long as the cell has a chemical receptor 
or is capable of having such a nucleic acid molecule 

10 introduced therein. Examples of cell sources include, 
but are not limited to, a single-cell culture; the embryo, 
blood, or body tissue of normally-grown transgenic animal; 
a mixture of cells derived from normally-grown cell lines; 
and the like. Preferably, a cell which is easily 

15 transformed or transfected is used. Cells used in the 
present invention are preferably cells which easily 
introduces a nucleic acid thereinto. 

Cells used herein may be derived from any 
20 organism (e.g., any unicellular organisms (e.g., bacteria 
and yeast) or any multicellular organisms (e.g., animals 
(e.g., vertebrates and invertebrates), plants (e.g., 
monocotyledons and dicotyledons, etc.)). For example, 
cells used herein are derived from a vertebrate (e.g., 
25 Myxinif ormes , Pe tronyzoni forme s , Chondrichthyes , 

Osteichthyes , amphibian, reptilian, avian, mammalian, 
etc.), more preferably mammalian (e.g., monotremata, 
marsupialia, edentate, dermoptera, chiroptera, carnivore, 
insectivore, proboscidea, perissodactyla, artiodactyla, 
30 tubulidentata, pholidota, sirenia, cetacean, primates, 
rodentia, lagomorpha, etc.). In one embodiment, cells 
derived from Primates (e.g., chimpanzee, Japanese monkey, 
human) are used. Particularly, without limitation, cells 
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derived from a human are used. The above-described cells 
may be either stem cells or somatic cells . Also, the cells 
may be adherent cells, suspended cells, tissue forming 
cells, and mixtures thereof. 

5 

Any organ may be targeted by the present 
invention. A tissue or cell targeted by the present 
invention may be derived from any organ. As used herein, 
the term ^'organ'' refers to a morphologically independent 

10 structure localized at a particular portion of an 
individual organism in which a certain function is 
performed. In multicellular organisms (e.g., animals, 
plants) , an organ consists of several tissues spatially 
arranged in a particular manner, each tissue being 

15 composed of a number of cells. An example of such an organ 
includes an organ relating to the vascular system. In one 
embodiment, organs targeted by the present invention 
include, but are not limited to, skin, bloodvessel, cornea, 
kidney, heart, liver, umbilical cord, intestine, nerve, 

20 lung, placenta, pancreas, brain, peripheral limbs, retina, 
and the like. In plant, ^^organ'' includes, but is not 
limited to: callus,, root, caulome, stem, stalk, leaf, 
flower, seed, embryo, germ, fruit, albumen and the like. 

25 As used herein, the term ^''tissue'' refers to 

an aggregate of cells having substantially the same 
function and/or form in a multicellular organism. 
^'Tissue" is typically an aggregate of cells of the same 
origin, but may be an aggregate of cells of different 

30 origins as long as the cells have the same function and/or 
form. Therefore, when stem cells of the present invention 
are used to regenerate tissue, the tissue may be composed 
of an aggregate of cells of two or more different origins. 
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Typically, a tissue constitutes a part of an organ. 
Animal tissues are separated into epithelial tissue, 
connective tissue, muscular tissue, nervous tissue, and 
the like, on a morphological, functional, or developmental 
5 basis. Plant tissues are roughly separated into 
meristematic tissue and permanent tissue according to the 
developmental stage of the cells constituting the tissue. 
Alternatively, tissues may be separated into single 
tissues and composite tissues according to the type of 
10 cells constituting the tissue. Thus, tissues are 
separated into various categories. 

As used herein, the term ^^stem cell'' refers 
to a cell capable of self replication and pluripotency - 

15 Typically, stem cells can regenerate an injured tissue. 
Stem cells used herein may be, but are not limited to, 
embryonic stem (ES) cells or tissue stem cells (also called 
tissular stem cell, tissue-specif ic stem cell, or somatic 
stem cell) . Accordingly, a stem cell may be directly used 

20 in the present invention . 

As used herein, the term ^^somatic cell" refers 
to any cell other than a germ cell, such as an egg, a sperm, 
or the like, which does not transfer its DNA to the next 
25 generation. Typically, somatic cells have limited or no 
pluripotency. Somatic cells used herein may be 
naturally-occurring or genetically modified. 

As used herein, the term ^^isolated" means that 
30 naturally accompanying material is at least reduced, or 
preferably substantially completely eliminated, in 
normal circumstances. Therefore, the term ^^isolated 
cell'' refers to a cell substantially free from other 
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accompanying substances (e.g., other cells, proteins, 
nucleic acids, etc.) in natural circumstances. The term 
^^isolated" in relation to nucleic acids or polypeptides 
means that, for example, the nucleic acids or^ the 
5 polypeptides are substantially free from cellular 
substances or culture media when they are produced by 
recombinant DNA techniques; or precursory chemical 
substances or other chemical substances when they are 
chemically synthesized. Isolated nucleic acids are 
10 preferably free from sequences naturally flanking the 
nucleic acid within an organism from which the nucleic 
acid is derived (i.e., sequences positioned at the 5' 
terminus and the 3' terminus of the nucleic acid) . 

15 As used herein, the term ^^established" in 

relation to cells refers to a state of a cell in which 
a particular property (pluripotency) of the cell is 
maintained and the cell undergoes stable proliferation 
under culture conditions- Therefore, established stem 

20 cells maintain pluripotency. In the present invention, 
such an established cell is preferably used since such 
a cell provides a stabilized result. 

As used herein, the term ^differentiated 
25 cell" refers to a cell having a specialized function and 
form (e.g., muscle cells, neurons, etc.). Unlike stem 
cells, differentiated cells have no or little pluripotency. 
Examples of differentiated cells include epidermal cells, 
pancreatic parenchymal cells, pancreatic duct cells, 
30 hepatic cells, blood cells, cardiac muscle cells, skeletal 
muscle cells, osteoblasts, skeletal myoblasts, neurons, 
vascular endothelial cells, pigment cells, smooth muscle 
cells, fat cells, bone cells, cartilage cells, and the 
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like . 

As used herein the term "a living body" refers 
to a form of a biological organism which may be present 
5 as a single individual capable of existing as a living 
organism. 

(Biochemistry and Molecular Biology) 
As used herein, the term ^^gene'' refers to an 

10 element defining a genetic trait. A gene is typically 
arranged in a given sequence on a chromosome. A gene which 
defines the primary structure of a protein is called a 
structural gene. A gene which regulates the expression 
of a structural gene is called a regulatory gene (e.g.,- 

15 promoter) . Genes herein include structural genes and 
regulatory genes unless otherwise specified. Therefore^ 
the term ^^cyclin gene" typically includes the structural 
gene of cyclin and the promoter of cyclin. As used herein^ 
^^gene" may refer to ^^polynucleotide", ^^oligonucleotide", 

20 ^^nucleic acid", and ^^nucleic acid molecule" and/or 
^^protein", ^^polypeptide", ^^oligopeptide" and ^^peptide". 
As used herein, ^^gene product" includes ^^polynucleotide", 
^^oligonucleotide", ^^nucleic acid" and ^^nucleic acid 
molecule" and/or ^^protein", ^^polypeptide", 

25 ^^oligopeptide" and ^""peptide", which are expressed by a 
gene. Those skilled in the art understand what a gene 
product is, according to the context. Accordingly, gene 
used herein usually includes not only double-stranded DNA 
but also each single-stranded DNA such as sense chain and 

30 antisense chain constituting thereof. Therefore, the 
genes of the present invention include any of 
double-stranded DNA including human genome DNA, and 
single-stranded DNA (sense chain) including cDNA, as well 
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as a single stranded DNA (antisense) having a sequence 
complementary to the sense chain, as well as fragments 
thereof. 

5 As used herein, the term ^^homology'' in 

relation to a sequence (e.g., a nucleic acid sequence, 
an amino acid sequence, etc.) refers to the proportion 
of identity between two or more gene sequences . Therefore, 
the greater the homology between two given genes is, the 

10 greater is the identity or similarity between their 
sequences. Whether or not two genes have homology is 
determined by comparing their sequences directly or by 
a hybridization method under stringent conditions. When 
two gene sequences are directly compared with each other, 

15 these genes have homology if the DNA sequences of the genes 
have representatively at least 50% identity, preferably 
at least 70% identity, more preferably at least 80%, 90%, 
95%, 96%, 97%, 98%, or 99% identity with each other. As 
used herein, the term ^^similarity" in relation to a 

20 sequence (e.g., a nucleic acid sequence, an amino acid 
sequence, or the like) refers to the proportion of identity 
between two or more sequences when conservative 
substitution is regarded as positive (identical) in the 
above-described homology. Therefore, homology and 

25 similarity differ from each other in the presence of 
conservative substitutions. If no conservative 

substitutions are present, homology and similarity have 
the same value. 

30 The similarity, identity and homology of 

amino acid sequences and base sequences are herein 
compared using FASTA with the default parameters. 
Alternatively, an identity search may be conducted, for 
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example, using NCBI ' s BLAST 2 . 2 . 9 (published May 12, 2004). 
As used herein, the value of identity usually refers to 
the value as a result of alignment with the BLAST as 
described above using the default parameter. If the 
5 change of parameters results in higher values, then the 
highest value is employed herein as the value of the 
identity. When a plurality of regions are evaluated for 
identity, the highest value is employed herein as the value 
of the identity. 

10 

The terms ^^protein", ^''polypeptide'', 
^'oligopeptide" and ^'peptide'' as used herein have the same 
meaning and refer to an amino acid polymer having any 
length. This polymer may be a straight, branched or 

15 cyclic chain. An amino acid may be a. naturally-occurring 
or non-naturally-bccurring amino acid, or a variant amino 
acid. The term may include those assembled into a 
composite of a plurality of polypeptide chains. The term 
also includes a naturally-occurring or artificially 

20 modified amino acid polymer. Such modification includes, 
for example, disulfide bond formation, glycosylation, 
lipidation (acylation) , acetylation, phosphorylation, or 
any other manipulation or modification (e.g., conjugation 
with a labeling moiety) . This definition encompasses a 

25 polypeptide containing at least one amino acid analog 
(e.g.^ non-naturally-occurring amino acid, etc.), a 
peptide-like compound (e.g. , peptoid) , and other variants 
known in the art. Gene products, such as extracellular 
matrix proteins (e.g., fibronectin, etc.), are usually 

30 in the form of polypeptide, however, there may be a form 
of a polypeptide variant as long as it has the same function. 
Polypeptides having specific amino acid sequences include 
fragments, cognates, derivatives and variants thereof. 
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The terms ^^polynucleotide 

^^oligonucleotide", ^^nucleic acid molecule" and ^^nucleic 
acid" as used herein have the same meaning and refer to 
5 a nucleotide polymer having any length. This term also 
includes an ^^oligonucleotide derivative" or a 
^^polynucleotide derivative" . An ^^oligonucleotide 

derivative" or a ^''polynucleotide derivative" includes a 
nucleotide derivative, or refers to an oligonucleotide 

10 or a polynucleotide having linkages between nucleotides 
different from typical linkages, which are 
interchangeably used. Examples of such an 

oligonucleotide specifically include 

2' -0-methyl-ribonucleotide, an oligonucleotide 

15 derivative in which a phosphodiester bond in an 
oligonucleotide is converted to a phosphorothioate bond, 
an oligonucleotide derivative in which a phosphodiester 
bond in an oligonucleotide is converted to a N3'-P5' 
phosphoroamidate bond, an oligonucleotide derivative in 

20 which a ribose and a phosphodiester bond in an 
oligonucleotide are converted to a peptide-nucleic acid 
bond, an oligonucleotide derivative in which uracil in 
an oligonucleotide is substituted with C-5 propynyl uracil, 
an oligonucleotide derivative in which uracil in an 

25 oligonucleotide is substituted with C-5 thiazole uracil, 
an oligonucleotide derivative in which cytosine in an 
oligonucleotide is substituted with C-5 propynyl cytosine, 
an oligonucleotide derivative in which cytosine in an 
oligonucleotide is substituted with phenoxazine-modif led 

30 cytosine, an oligonucleotide derivative in which ribose 
in DNA is substituted with 2'-0-propyl ribose, and an 
oligonucleotide derivative in which ribose in an 
oligonucleotide is substituted with 2' -methoxyethoxy 
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ribose. Unless otherwise indicated, a particular nucleic 
acid sequence also implicitly encompasses 
conservatively-modified variants thereof (e.g. 
degenerate codon substitutions) and complementary 
5 sequences, as well as the sequence explicitly indicated. 
Specifically, degenerate codon substitutions may be 
produced by generating sequences in which the third 
position of one or more selected (or all) codons is 
substituted with mixed-base and/or deoxyinosine residues 

10 (Batzer et al . , Nucleic Acid Res . 19: 5081 (1991); Ohtsuka 
etal., J. Biol. Chem. 260:2605-2608 (1985); Rossolini 
et al. , Mol. Cell. Probes 8: 91-98 (1994) ) . A gene encoding 
an extracellular matrix protein (e.g., fibronectin, etc.) 
or the like is usually in the form of polynucleotide. A 

15 molecule to be transfected is in the form of 
polynucleotide . 



As used herein amino acids may be referred to 
with the generally known three-letter abbreviation or the 
20 one letter-abbreviation proposed by the lUPAC-IUB 
Biochemical Nomenclature Commission. Nucleotides may 
also be referred to with the generally known one-letter 
abbreviations which are generally accepted. 



25 The letter codes are as follows: 
Amino Acids: 
3-letter single-letter reference 

Ala A alanine 

Cys • C cysteine 

30 Asp D aspartic acid 

Glu E glutamic acid 

Phe F phenylalanine 

Gly G glycine 



56 



KJ007 



His 


H 




lie 


I 


JL 0 w -L. v7 U w ^ 


Lys 


K 


lys ine 


T .01 1 


T. 






M 


L.I1 J-V^Il-LXlc^ 


7\ c? "m 
rAo 1 1 


M 

IM 


o.opa.x <a.g xne 


Pjto 


P 


T> T*o 1 "i n 

X N>y ^ J- X 1 


Gin 




_I_ L. C^J.LL_l_ 1 1 V.« 


Arg 


R 


airoinine 


0 vi. j- 


Q 
*j 


c? ^ y n Ti fix 
0 X X i. i vZ- 


Th T 


T 

X 


■f" Vl 0 "PI 1 Tl 0 


Val 
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valine 


Trp 
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tryptophane 


Tyr 
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tyrosine 


Asx 




asparatic acid or 


asparagine 






Glx 




glutamine or glutamic acid 


Xaa 




unknown or other amino acid 



20 Base (Nucleotide) 



abbreviation 


reference 


a 


adenine 


g 


guanine 


c 


cytosine 


t 


thymine 


u 


uracyl 


r 


guanine or adenine purine 


y 


thymine/uracil or cytosine purimidine 


m 


adenin or cytocine amino group 


k 


guanine or thymine uracil keto group 


s 


guanin or cytosine 


w 


adenine or thymine/uracil 


b 


guanine or cytocine or thymine/uracil 
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d adenine or guanine or thymine/uracil 

h adenine or cytosine or thymine/uracil 

V adenine or guanine or cytosine 

n adenine or guanine or cytosine or 



5 thymine/uracil^ unknown or other base 

As used herein, the similarity, identity and 
homology of amino acid sequences and base sequences are 
herein compared using BLAST, a sequence analysis tool, 
10 with the default parameters. 

As used herein the term ^^nucleotide'' refers 
to a nucleoside in which the sugar moiety is phosphate 
ester, and includes DNA, RNA and the like, and may be 

15 naturally occurring or non-naturally occurring. 
Nucleoside refers to a compound in which a base and a sugar 
are bound via N-glycoside bonding. '^^Nucleotide 
derivative" or ^^nucleotide analog" are interchangeably 
used herein to refer to a derivative or an analog which 

20 is different from a naturally occurring nucleotide but 
has a similar function as that of such a nucleotide- Such 
a nucleotide derivative and nucleotide analog is well 
known in the art. Examples of such a nucleotide 
derivative and nucleotide analog include, for example, 

25 but are not limited to phosphorothioate^ phosphoramidate, 
methyl phosphonate, chiral methyl phosphonate, 
2-0-methyl ribonucleotide, peptide-nucleic acid (PNA) . 
DNA includes cDNA, genomic DNA, and synthetic - DNA. 

30 In an embodiment, the variant refers to a 

naturally occurring allelic variant,. non-naturally 
occurring variant, a variant having deletion, 
substitution, addition and addition, a polynucleotide 
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sequence which does not substantially alter the function 
of the encoded polypeptide. 

In an embodiment, variation such as mutation 
5 of such amino acid sequences may occur in nature such as 
natural mutation, post-translational modification and 
the like, but also may artificially made using a naturally 
occurring gene such as specific genes of the present 
invention. 

10 

In an embodiment, the polypeptide comprises 
the allelic variants, homologs, natural variants, having 
at least 70%, preferably at least 80%, more preferably 
at least 95 %, still more preferably at least 97 % homology 
15 with the naturally occurring polypeptide. 

As used herein, the term ^^corresponding'' 
amino acid or nucleic acid refers to an amino acid or 
nucleotide in a given polypeptide or polynucleotide 

20 molecule, which has, or is anticipated to have, a function 
similar to that of a predetermined amino acid or nucleotide 
in a polypeptide or polynucleotide as a reference for 
comparison. Particularly, in the case of enzyme 
molecules, the term refers to an amino acid which is 

25 present at a similar position in an active site and 
similarly contributes to catalytic activity. For example, 
in the case of the transposon sequence for a certain 
polynucleotide, the term refers to a similar portion in 
an ortholog corresponding to a particular portion of the 

30 transposon sequence. 

As used herein, the term ^^corresponding" gene 
(e.g., a polypeptide or polynucleotide molecule) refers 
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to a gene in a given species, which has, or is anticipated 
to have, a function similar to that of a predetermined 
gene in a species as a reference for comparison. When 
there are a plurality of genes having such a function, 
5 the term refers to a gene having the same evolutionary 
origin. Therefore, a gene corresponding to a given gene 
may be an ortholog of the given gene. Therefore, genes 
corresponding to those such as murine transposon and 
murine transposase can be found in other animals. Such 

10 a corresponding gene can be identified by techniques well 
known in the art . Therefore, for example, a corresponding 
gene in a given animal can be found by searching a sequence 
database of the animal (e.g., human, rat, dog, cat) using 
the sequences such as murine transposon and murine 

15 transposase of a reference gene as a query sequence. Such 
corresponding genes can be readily obtained by those 
skilled in the art using genome databases. Methods for 
obtaining such genome sequences are well known in the art 
and described herein elsewhere. In the present invention, 

20 sequences obtained by such search can also be used. 

As used herein, the term ^^fragment" with 
respect to a polypeptide or polynucleotide refer to a 
polypeptide or polynucleotide having a sequence length 

25 ranging from 1 to n-1 with respect to the full length of 
the reference polypeptide or polynucleotide (of length 
n) . The length of the fragment can be appropriately 
changed depending on the purpose. For example, in the 
case of polypeptides, the lower limit of the length of 

30 the fragment includes 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 
25, 30, 40, 50 or more nucleotides. Lengths represented 
by integers which are not herein specified (e.g., 11 and 
the like) maybe appropriate as a lower limit. For example. 
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in the case of polynucleotides, the lower limit of the 
length of the fragment includes 5, 6, 1, 8, 9, 10, 15, 
20, 25, 30, 40, 50, 75, 100 or more nucleotides - Lengths 
represented by integers which are not herein specified 
5 Xe . g. , 11 and the like) may be appropriate as a lower limit . 
As used herein, the length of polypeptides or 
polynucleotides can be represented by the number of amino 
acids or nucleic acids, respectively. However, the 
above-described numbers are not absolute. The 

10 above-described numbers as the upper or lower limit are 
intended to include some greater or smaller numbers (e.g. , 
±10%), as long as the same function is maintained. For 
this purpose, about" may be herein put before the numbers . 
However, it should be understood that the interpretation 

15 of numbers is not affected by the presence or absence of 
about'' in the present specification. In the present 
invention, it should be understood that any fragment can 
be used as long as the fragment functions as murine 
transposon, murine transposase and the like, i.e., has 

20 transposition activity. 

As used herein, the term ^^biological 
molecule'' refers to a molecule relating to an organism 
and an aggregation thereof. 

25 

As used herein, the term ^^biological" or 
^^organism" refers to a biological organism, including, 
but being not limited to, an animal, a plant, a fungus, 
a virus, and the like. A biological molecule includes a 
30 molecule extracted from an organism and an aggregation 
thereof, though the present invention is not limited to 
this. Any molecule capable of affecting an organism and 
an aggregation thereof fall within the definition of a 
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biological molecule. Therefore, low molecular weight 
molecules (e.g., low molecular weight molecule ligands, 
etc.) capable of being used as medicaments fall within 
the definition of biological molecule as long as an effect 
5 on. an organism is intended. Examples of such a biological 
molecule include, but are not limited to, a protein, a 
polypeptide, an oligopeptide, a peptide, a polynucleotide, 
an oligonucleotide, a nucleotide, a nucleic acid (e.g., 
DNA such as cDNA and genomic DNA; RNA such as mRNA) , a 

10 polysaccharide, an oligosaccharide, a lipid, a low 
molecular weight molecule (e.g., a hormone, a ligand, an 
information transmitting substance, a low molecular 
weight c5)rganic molecule, etc.) , and a composite molecule 
thereof (glycolipids, glycoproteins, lipoproteins, etc.), 

15 and the like. A biological molecule may include a cell 
itself or a portion of tissue as long as it is intended 
to be introduced into a cell. Preferably, a biological 
molecule may include a nucleic acid (DNA or RNA) or a 
protein- In another preferred embodiment, a biological 

20 molecule is a nucleic acid (e.g., genomic DNA or cDNA, 
or DNA synthesized by PGR or the like) . In another 
preferred embodiment, a biological molecule may be a 
protein. Preferably, such a biological molecule may be 
a hormone or cytokine. 

25 

As used herein ^^chemical synthesized 
substance'' refers to any substance which may be 
synthesized using an ordinary chemical technology. 
Accordingly, any chemical synthesized substance are 
30 within chemical substances. Substantially all chemical 
substances may be synthesized. Such synthetic technology 
is well known in the art, and those skilled in the art 
can produce chemical synthesized substance appropriately 
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combining such technology. 

As used herein, the term ^^biological 
activity" refers to activity possessed by an agent (e.g. , 
5 a polynucleotide, a protein, etc.) within an organism, 
including activities exhibiting various functions (e.g., 
transcription promoting activity, etc.). For example, 
when an agent is an antisense molecule, the biological 
activity thereof includes binding to a targeted nucleic 

10 acid molecule, suppression of expression thereby and the 
like. For example, when an agent is an enzyme, the 
biological activity thereof includes the enzymatic 
activity thereof- As for another example, when an agent 
is a ligand or a receptor, binding to the receptor or the 

15 ligand corresponding to the ligand or receptor, 
respectively, is included in the biological activity 
thereof^ When the biological activity is transcriptional 
regulation activity, the activity refers to an activity 
for regulating transcriptional level or the variation 

20 thereof. For example, if an agent is a transposon, the 
biological activity thereof is transposition activity- 
Examples in which transposition activity is measured, 
include, for example, technologies described in the 
Examples, and such a biological activity may be determined 

25 by a well known technology in the art. 

As used herein, ^'polynucleotides hybridizing 
under stringent conditions'' refers to conditions commonly 
used and well known in the art. Such a polynucleotide can 
30 be obtained by conducting colony hybridization, plaque 
hybridization. Southern blot hybridization, or the like 
using a polynucleotide selected from the polynucleotides 
of the present invention. Specifically, a filter on which 
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DNA derived from a colony or plaque is iiranobilized is used 
to conduct hybridization at 65°C in the presence of 0.7 
to 1.0 M NaCl. Thereafter, a 0 . 1 to 2-fold concentration 
SSC (saline-sodium citrate) solution (1-fold 
5 concentration SSC solution composed of 150 mM sodium 
chloride and 15 mM sodium citrate) is used to wash the 
filter at 65°C . Polynucleotides identified by this method 
are referred to as ^^polynucleotides hybridizing under 
stringent conditions" . Hybridization can be conducted in 

10 accordance with a method described in, for example. 
Molecular Cloning 2nd ed. , Current Protocols in Molecular 
Biology, Supplement 1-38, DNA Cloning 1: Core Techniques, 
A Practical Approach, Second Edition, Oxford University 
Press (1995), and the like. Here, sequences hybridizing 

15 under stringent conditions exclude, preferably, 
sequences containing only A (adenine) or T (thymine) . As 
used herein, ^^hybridizable polynucleotide'' refers to a 
polynucleotide which can hybridize to other 
polynucleotides under the above-described hybridization 

20 conditions. Specifically, the hybridizable 

polynucleotide includes at least a polynucleotide having 
a homology of at least 60% to the base sequence of DNA 
encoding a polypeptide having an amino acid sequence as 
specifically set forth herein, preferably a 

25 polynucleotide having a homology of at least 80%, and more 
preferably a polynucleotide having a homology of at least 
95%. 

As used herein, the term ^^probe'' refers to a 
30 substance for use in searching, which is used in a 
biological experiment, such as in vitro and/or in vivo 
screening or the like, including, but not being limited 
to, for example, a nucleic acid molecule having a specific 
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base sequence or a peptide containing a specific amino 
acid sequence. 

Examples of a nucleic acid molecule as a 
5 common probe include one having a nucleic acid sequence 
having a length of at least 8 contiguous nucleotides, which 
is homologous or complementary to the nucleic acid 
sequence of a gene of interest. Such a nucleic acid 
sequence may be preferably a nucleic acid sequence having 
10 a length of at least 9 contiguous nucleotides, more 
preferably a length of at least 10 contiguous nucleotides, 
and even more preferably a length of at least 11 contiguous 
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includes a nucleic acid sequence having at least 70% 
homology to the above-described sequence, more preferably 
25 at least 80%, and even more preferably at least 90% or 
at least 95%. 

As used herein, the term ^^search" indicates 
that a given nucleic acid sequence is utilized to find 
30 other nucleic acid base sequences having a specific 
function and/or property either electronically or 
biologically, or using other methods. Examples of an 
electronic search include, but are not limited to, BLAST 



65 KJ007 



(Altschul et al. , J. Mol. Biol. 215:403-410 (1990)), FASTA 
(Pearson & Lipman, Proc. Natl. Acad. Sci., USA 
85:2444-2448 (1988)), Smith and Waterman method (Smith 
and Waterman, J. Mol. Biol. 147:195-197 (1981)), and 
5 Needleman and Wunsch method (Needleman and Wunsch, J. Mol. 
Biol. 48:443-453 (1970)), and the like. Examples of a 
biological search include, but are not limited to, a 
macroarray in which genomic DNA is attached to a nylon 
membrane or the like or a microarray (microassay) in which 
10 genomic DNA is attached to a glass plate under stringent 
hybridization, PGR and in situ hybridization, and the 
like. 

The term ^^highly stringent conditions" refers 

15 to those conditions that are designed to permit 
hybridization of DNA strands whose sequences are highly 
complementary, and to exclude hybridization of 
significantly mismatched DNAs . Hybridization stringency 
is principally determined by temperature, ionic strength, 

20 and the concentration of denaturing agents such as 
formamide. Examples of ^^highly stringent conditions" for 
hybridization and washing are 0.0015 M sodium chloride, 
0.0015 M sodium citrate at 65-68'*C or 0.015 M sodium 
chloride, 0.0015 M sodium citrate, and 50% formamide at 

25 42*^C. See Sambrook, Fritsch & Maniatis, Molecular Cloning : 
A Laboratory Manual (2nd ed.. Cold Spring Harbor 
Laboratory, N.Y.^ 1989); Anderson et al.. Nucleic Acid 
Hybridization: A Practical Approach Ch. 4 (IRL Press 
Limited) (Oxford Express) . More stringent conditions 

30 (such as higher temperature, lower ionic strength^ higher 
formamide, or other denaturing agents) may be optionally 
used. Other agents may be included in the hybridization 
and washing buffers for the purpose of reducing 
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non-specific and/or background hybridization. Examples 
are 0 . 1% bovine serum albumin, 0 . 1% polyvinylpyrrolidone, 
0.1% sodium pyrophosphate, 0.1% sodium dodecylsulf ate 
(NaDodS04 or SDS) , Ficoll, Denhardt's solution, sonicated 
5 salmon sperm DNA (or other non-complementary DNA) , and 
dextran sulfate, although other suitable agents can also 
be used. The concentration and types of these additives 
can be changed without substantially affecting the 
stringency of the hybridization conditions. 
10 Hybridization experiments are ordinarily carried out at 
pH 6.8-7.4; however, at typical ionic strength conditions , 
the rate of hybridization is nearly independent of pH. 
See Anderson et al.. Nucleic Acid Hybridization: A 
Practical Approach Ch. 4 (IRL Press Limited, Oxford UK) . 

15 

Agents affecting the stability of DNA duplex 
include base composition, length, and degree of base pair 
mismatch. Hybridization conditions can be adjusted by 
those skilled in the art in order to accommodate these 
20 variables and allow DNAs of different sequence relatedness 
to form hybrids. The melting temperature of a perfectly 
matched DNA duplex can be estimated by the following 
equation: 

25 Tm (°C) = 81.5 + 16.6 (log[Na'']) + 0.41 (% G+C) - 600/N 
- 0.72 (% formamide) 

where N is the length of the duplex formed, [Na"^] is the 
molar concentration of the sodium ion in the hybridization 
30 or washing solution, "% G+C is the percentage of 
(guanine+cytosine) bases in the hybrid. For imperfectly 
matched hybrids, the melting temperature is reduced by 
approximately l^C for each 1% mismatch. 
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The term ^^moderately stringent conditions" 
refers to conditions under which a DNA duplex with a 
greater degree of base pair mismatching than could occur 
5 under ^^highly stringent conditions" is able to form. 
Examples of typical ^^moderately stringent conditions" are 
0.015 M sodium chloride, 0.0015 M sodium citrate at 
50-65''C or 0 . 015 M sodium chloride, 0 . 0015 M sodium citrate, 
and 20% formamide at 37-50°C. By way of example, 
10 ^^moderately stringent conditions" of 50*^C in 0.015 M 
sodium ion will allow about a 21% mismatch. 

It will be appreciated by those skilled in the 
art that there is no absolute distinction between ^^highly 

15 stringent conditions" and ^^moderately stringent 
conditions". For example, at 0.015 M sodium ion (no 
formamide) , the melting temperature of perfectly matched 
long DNA is about 71°C. With a wash at 65°C (at the same 
ionic strength) , this would allow for approximately a 6% 

20 mismatch. To capture more distantly related sequences, 
those skilled in the art can simply lower the temperature, 
or raise the ionic strength. 

A good estimate of the melting temperature in 
25 1 M NaCl for oligonucleotide probes up to about 
20 nucleotides is given by: 

Tm= (2°C per A-T base pair) + ( 4''C per G-C base pair ) . 

Note that the sodium ion concentration in 6X 
30 salt sodium citrate (SSC) is 1 M. See Suggs et al.. 
Developmental Biology Using Purified Genes 683 (Brown and 
Fox, eds. , 1981) . 
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A naturally-occurring nucleic acid encoding 
a protein (e.g., transposon, transposase or a variant or 
fragment thereof, or the like) may be readily isolated 
from a cDNA library having PGR primers and hybridization 
5 probes containing part of a nucleic acid sequence 
indicated by, for example, SEQ ID NO. 1, 2, 10, 12, 14, 
16, 18 or the like. A preferable nucleic acid encoding 
chemical receptors, or variants or fragments thereof , or 
the like is hybridizable to the whole or part of a sequence 

10 as set forth in SEQ ID NO: 1, 2, 10, 12, 14, 16, 18 or 
the like under low stringency conditions defined by 
hybridization buffer essentially containing 1% bovine 
serum albumin (BSA) ; 500 mM sodium phosphate (NaP04) ; ImM 
EDTA; and 7% SOS at 42*^0, and wash buffer essentially 

15 containing 2xSSC (600 mMNaCl; 60 mM sodium citrate) ; and 
0.1% SDS at 50°C, more preferably under low stringency 
conditions defined by hybridization buffer essentially 
containing 1% bovine serum albumin (BSA); 500 mM sodium 
phosphate (NaP04) ; 15% formamide; 1 mM EDTA; and 7% SDS 

20 at 50°C, and wash buffer essentially containing IxSSC 
(300 mM NaCl; 30 mM sodium citrate); and 1% SDS at 50''C, 
and most preferably under low stringency conditions 
defined by hybridization buffer essentially containing 
1% bovine serum albumin (BSA) ; 200 mM sodium phosphate 

25 (NaP04) ; 15% formamide; 1 mM EDTA; and 7% SDS at 50°C, and 
wash buffer essentially containing O.SxSSC (150 mM NaCl; 
15 mM sodium citrate); and 0.1% SDS at 65''C. 

As used herein, the term ^^probe'' refers to a 
30 substance for use in searching, which is used in a 
biological experiment, such as In vitro and/or in vivo 
screening or the like, including, but not being limited 
to, for example, a nucleic acid molecule having a specific 
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base sequence or a peptide containing a specific amino 
acid sequence. 

Examples of a nucleic acid molecule as a 
5 common probe include one having a nucleic acid sequence 
having a length of at least 8 contiguous nucleotides^ which 
is homologous or complementary to the nucleic acid 
sequence of a gene of interest. Such a nucleic acid 
sequence may be preferably a nucleic acid sequence having 
10 a length of at least 9 contiguous nucleotides, more 
preferably a length of at least 10 contiguous nucleotides, 
and even more preferably a length of at least 11 contiguous 
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includes a nucleic acid sequence having at least 7 0% 
homology to the above-described sequence, more preferably 
25 at least 80%, and even more preferably at least 90% or 
at least 95%. 

As used herein, the term ^'primer'' refers to 
a substance required for initiation of a reaction of a 
30 macromolecule compound to be synthesized, in a 
macromolecule synthesis enzymatic reaction. In a 
reaction for synthesizing a nucleic acid molecule, a 
nucleic acid molecule (e.g., DNA, RNA, or the like) which 
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is complementary to part of a macromolecule compound to 
be synthesized may be used. 

A nucleic acid molecule which is ordinarily 
5 used as a primer includes one that has a nucleic acid 
sequence having a length of at least 8 contiguous 
nucleotides, which is complementary to the nucleic acid 
sequence of a gene of interest. Such a nucleic acid 
sequence preferably has a length of at least 9 contiguous 
10 nucleotides, more preferably a length of at least 10 
contiguous nucleotides, even more preferably a length of 
at least 11 contiguous nucleotides, a length of at least 
12 contiguous nucleotides, a length of at least 13 
contiguous nucleotides, a length of at least 14 contiguous 
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includes a nucleic acid sequence having at least 70% 
homology to the above-described sequence, more preferably 
at least 80%, even more preferably at least 90%, and most 
preferably at least 95%. An appropriate sequence as a 
30 primer may vary depending on the property of the sequence 
to be synthesized (amplified) . Those skilled in the art 
can design an appropriate primer depending on the sequence 
of interest- Such a primer design is well known in the 
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art and may be performed manually or using a computer 
program (e.g., LASERGENE, Primer Select, DNAStar) . 

As used herein, the term ^^epitope" refers to 
5 an antigenic determinant whose structure is clear. 
Therefore, the term ^''epitope" includes a set of amino acid 
residues which are involved in recognition by a particular 
immunoglobulin, or in the context of T cells, those 
residues necessary for recognition by T cell receptor 

10 proteins and/or Major Histocompatibility Complex (MHC) 
receptors. This term is also used interchangeably with 
^^antigenic determinant'' or ^^antigenic determinant site". 
In the field of immunology, in vivo or in vitro, an epitope 
is the feature of a molecule (e.g., primary, secondary 

15 and tertiary peptide structure, and charge) that forms 
a site recognized by an immunoglobulin, T cell receptor 
or MHC (e.g. HLA) molecule. An epitope including a 
peptide comprises 3 or more amino acids in a spatial 
conformation which is unique to the epitope. Generally, 

20 an epitope consists of at least 5 such amino acids, and 
more ordinarily, consists of at least 6, 7, 8, 9 or 10 
such amino acids. The greater the length of an epitope, 
- the more the similarity of the epitope to the original 
peptide, i.e., longer epitopes are generally preferable . 

25 This is not necessarily the case when the conformation 
is taken into account. Methods of determining the spatial 
conformation of amino acids are known in the art, and 
include, for example. X-ray crystallography and 
two-dimensional nuclear magnetic resonance spectroscopy. 

30 Furthermore, the identification of epitopes in a given 
protein is readily accomplished using techniques well 
known in the art. See, also, Geysen et al., Proc. Natl. 
Acad. Sci. USA (1984) 81: 3998 (general method of rapidly 
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synthesizing peptides to determine the location of 
immunogenic epitopes in a given antigen) ; U.S. Patent No. 
4,708,871 (procedures for. identifying and chemically 
synthesizing epitopes of antigens); and Geysen et al., 
5 Molecular Immunology (1986) 23: 709 (technique for 
identifying peptides with high affinity for a given 
antibody) . Antibodies that recognize the same epitope 
can be identified in a simple immunoassay. Thus, methods 
for determining an epitope including a peptide are well 
10 known in the art. Such an epitope can be determined using 
a well-known, common technique by those skilled in the 
art if the primary nucleic acid or amino acid sequence 
of the epitope is provided. 

15 Therefore, an epitope including a peptide 

requires a sequence having a length of at least 3 amino 
acids, preferably at least 4 amino acids, more preferably 
at least 5 amino acids, at least 6 amino acids, at least 
7 amino acids, at least 8 amino acids, at least 9 amino 

20 acids, at least 10 amino acids, at least 15 amino acids, 
at least 20 amino acids, and at least 25 amino acids. 
Epitopes may be determined by those skilled in the art 
by using commercially available kit, such as PepSet™ 
(Kurabo) . In the present invention, presenting a protein 

25 epitope playing a role in signal transduction may be used 
as a system for measuring signal transduction. 

As used herein, the term ^^agent binding 
specifically to" a certain nucleic acid molecule or 
30 polypeptide refers to an agent which has a level of binding 
to the nucleic acid molecule or polypeptide equal to or 
higher than a level of binding to other nucleic acid 
molecules or polypeptides. Examples of such an agent 



73 KJ007 



include, but are not limited to, when a target is a nucleic 
acid molecule, a nucleic acid molecule having a 
complementary sequence of a nucleic acid molecule of 
interest, a polypeptide capable of binding to a nucleic 
5 acid sequence of interest (e.g., a transcription agent, 
etc.), and the like, and when a target is a polypeptide, 
an antibody, a single chain antibody, either of a pair 
of a receptor and a ligand, either of a pair of an enzyme 
and a substrate, and the like. As used herein, such an 
10 agent specifically binding to (such as an agent 
specifically binding to calcium, an antibody against a 
specific gene product and the like) , can be used in 
measuring signal transduction. 

15 (Variation of polypeptides or polynucleotides) 

In the present invention, when using a functional 
polypeptide such as a transposon, transposase and the like, 
a variant thereof (as used herein it is called "functional 
variant") may be used as long as the variant can attain 

20 similar functions, such as transposition activity and the 
like . 

A given amino acid may be substituted with 
another amino acid in a protein structure, such as a 

25 cationic region or a substrate molecule binding site, 
without a clear reduction or loss of interactive binding 
ability. A given biological function of a protein is 
defined by the interactive ability or other property of 
the protein. Therefore, a particular amino acid 

30 substitution may be performed in an amino acid sequence, 
or at the DNA code sequence level, to produce a protein 
which maintains the original property after the 
substitution. Therefore, various modifications of 
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peptides as disclosed herein and DNA encoding such 
peptides may be performed without clear losses of 
biological usefulness . 

5 When the above-described modifications are 

designed, the hydrophobicity indices of amino acids may 
be taken into consideration. Hydrophobic amino acid 
indices play an important role in providing a protein with 
an interactive biological function, which is generally 

10 recognized in the art (Kyte, J. and Doolittle, R.F., J. 
Mol. Biol. 157 (1) : 105-132, 1982). The hydrophobic 
property of an amino acid contributes to the secondary 
structure of a protein and then regulates interactions 
between the protein and other molecules (e.g., enzymes, 

15 substrates, receptors, DNA, antibodies, antigens, etc.). 
Each amino acid is given a hydrophobicity index based on 
the hydrophobicity and charge properties thereof as 
follows: isoleucine (+4.5); valine (+4.2); leucine 
(+3-8); phenylalanine (+2.8); cysteine/cystine (+2.5); 

20 methionine (+1.9); alanine (+1.8); glycine (-0.4); 
threonine (-0.7); serine (-0.8); tryptophan (-0.9); 
tyrosine (-1.3); proline (-1.6); histidine (-3.2); 
glutamic acid (-3.5); glutamine (-3.5); aspartic acid 
(-3.5); asparagine (-3.5); lysine (-3.9); and arginine 

25 (-4.5). 

It is well known that if a given amino acid 
is substituted with another amino acid having a similar 
hydrophobicity index, the resultant protein may still have 
30 a biological function similar to that of the original 
protein (e.g., a protein having an equivalent enzymatic 
activity) . For such an amino acid substitution, the 
hydrophobicity index is preferably within ±2, more 
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preferably within ±1 , and even more preferably within ±0.5. 
It is understood in the art that such an amino acid 
substitution based on hydrophobicity is efficient. 

5 A hydrophilicity index is also useful for 

modification of an amino acid sequence of the present 
invention. As described in US Patent No. 4,554,101, 
amino acid residues are given the following hydrophilicity 
indices: arginine (+3.0); lysine (+3.0); aspartic acid 

10 (+3.0±1); glutamic acid (+3.0±1); serine (+0.3); 
asparagine (+0.2); glutamine (+0.2); glycine (0); 
threonine (-0.4); proline (-0.5+1); alanine (-0.5); 
histidine (-0.5); cysteine (-1.0); methionine (-1.3); 
valine (-1.5); leucine (-1.8); isoleucine (-1.8); 

15 tyrosine (-2.3); phenylalanine (-2.5); and tryptophan 
(-3.4). It is understood that an amino acid may be 
substituted with another amino acid which has a similar 
hydrophilicity index and can still provide a biological 
equivalent- For such an amino acid substitution, the 

20 hydrophilicity index is preferably within ±2, more 
preferably ±1, and even more preferably ±0.5. 

For example, it is well known in the art that 
the following RNA codon (therefore, in the corresponding 

25 DNA codon, T is replaced with U) , can be interchangeably 
used for encoding each of the particular amino acids: 
phenylalanine (Phe or F) : UUU or UUC; leucine (Leu or L) : 
UUA, UUG, CUU, cue, CUA or CUG; isoleucine (lie or I): 
AUU, AUG or AUA; methionine (Met or M) : AUG; valine (Val 

30 or V) : GUU, GUC, GUA or GUG; serine (Ser or S) : UCU, UCC, 
UCA, UCG, AGU or AGO; proline (Pro or P) : CUU, CCC, CCA 
or CCG; threonine (Thr or T) : ACU, ACC, ACA or ACG; alanine 
(Ala or A) : GCU, GCG, GCA or GCC; tyrosine (Tyr or Y) : 
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UAU or UAC; histidine (His or H) : CAU or CAC; glutamine 
(Gin or Q) : CAA or CAG; asparagine (Asn or N) : AAU or AAC; 
lysine (Lys or K) : AAA or AAG; asparatic acid (Asp or D) : 
GAU or GAG; glutamic acid (Glu or E) : GAA or GAG; cystein 
5 (Cys or C) : UGU or UGC; arginine (Arg or R) : CGU, CGC, 
CGA, CGG, AGA or AGO; glycine (Gly or G) : GGU, GGC, GGA 
or GGG; termination codon: UAA, UAG or UGA. Further, a 
specific DNA sequence is modified to employ a preferential 
codon for a specific cell type. For example, preferential 
10 codon usage of E. coli, is known in the art, as is the 
preferential codon usage of an animal and a human. Such 
a modification is well known in the art, and constitutes 
a part of the present invention. 

15 Variants thus produced are also within the 

scope of the present invention, and any of such variants 
are used in the present invention. 

(Antigen and antibody) 

20 As used herein, the term ^^antibody" 

encompasses polyclonal antibodies, monoclonal antibodies, 
human antibodies, humanized antibodies, polyf unctional 
antibodies, chimeric antibodies, and anti-idiotype 
antibodies, and fragments thereof (e.g., F(ab')2 and Fab 

25 fragments), and other recombinant conjugates. These 
antibodies may be fused with an enzyme (e.g., alkaline 
phosphatase, horseradish peroxidase, a-galactosidase, 
and the like) via a covalent bond or by recombination. 

30 As used herein, the term ^^monoclonal 

antibody" refers to an antibody composition having a group 
of homologous antibodies. This term is not limited by the 
production manner thereof. This term encompasses all 
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immunoglobulin molecules and Fab molecules, 
F(ab')2 fragments, Fv fragments, and other molecules 
having an immunological binding property of the original 
monoclonal antibody molecule. Methods for producing 
5 polyclonal antibodies and monoclonal antibodies are well 
known in the art, and will be more sufficiently described 
below. 

Monoclonal antibodies are prepared by using 

10 the standard technique well known in the art (e.g. , Kohler 
and Milstein, Nature (1975) 256:495) or a modification 
thereof (e.g.. Buck et al. (1982) In vitro 18:377). 
Representatively, a mouse or rat is immunized with a 
protein bound to a protein carrier, and boosted. 

15 Subsequently, the spleen (and optionally several large 
lymph nodes) is removed and dissociated into a single cell 
suspension. If desired, the spleen cells may be screened 
(after removal of nonspecif ically adherent cells) by 
applying the cell suspension to a plate or well coated 

20 with a protein antigen. B-cells that express 

membrane-bound immunoglobulin specific for the antigen 
bind to the plate, and are not rinsed away with the rest 
of the suspension- Resulting B-cells, or all dissociated 
spleen cells, are then induced to fuse with myeloma cells 

25 to form hybridomas. The hybridomas are used to produce 
monoclonal antibodies . 

As used herein, the term ^^antigen" refers to 
any substrate to which an antibody molecule may 
30 specifically bind. As used herein, the term ''^immunogen" 
refers to an antigen capable of initiating activation of 
the antigen-specific immune response of a lymphocyte. 
Accordingly, chemical receptors or products of the 
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downstream thereof may be used as an antigen or immunogen 
and uses antibody-antigen response to realize the sensor 
of the present invention. 

5 (Gene Engineering) 

As used herein, the term gene cassette" refers to 
a nucleic acid sequence comprising a DNA encoding a gene, 
a nucleic acid sequence comprising a gene promoter 
operably linked thereto (such that it can control the 

10 expression of the DNA) and a promoter, and optionally a 
heterologous gene operably linked thereto (i.e., in frame) . 
It is intended that the use of this cassette optionally 
in combination of another regulatory element is 
encompassed in the present invention. Preferably 

15 expression cassettes are those which are amenable to a 
specific restriction enzymatic digestion and feasible 
recovery. 

When a gene is mentioned herein, the term 
20 ^^vector" or ^^recombinant vector" refers to a vector 
transferring a polynucleotide sequence of interest to a 
target cell- Such a vector is capable of self-replication 
or incorporation into a chromosome in a host cell (e.g., 
a prokaryotic cell, yeast, an animal cell, a plant cell, 
25 an insect cell, an individual animal, and an individual 
plant, etc.), and contains a promoter at a site suitable 
for transcription of a polynucleotide of the present 
invention. A vector suitable for performing cloning is 
referred to as a ^^cloning vector". Such a cloning vector 
30 ordinarily contains a multiple cloning site containing 
a plurality of restriction sites. Restriction enzyme 
sites and multiple cloning sites as described above are 
well known in the art and can be used as appropriate by 
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those skilled in the art depending on the purpose in 
accordance with publications described herein (e.g., 
Sambrook et al., supra). 

5 As used herein, the term ^^expression vector''' 

refers to a nucleic acid sequence comprising a structural 
gene and a promoter for regulating expression thereof;, 
and in addition, various regulatory elements in a state 
that allows them to operate within host cells. The 

10 regulatory element may include, preferably, terminators, 
selectable markers such as drug-resistance genes, and 
enhancers. It is well known in the art that a type of an 
expression vector of a living organism such as an animal 
and a species of a regulatory element used may vary 

15 depending on the type of host cell used. 

Examples of ^^recombinant vectors'' for 
prokaryotic cells include, but are not limited to, 
pcDNA3( + ), pBluescript-SK( + /-) , pGEM-T, pEF-BOS, pEGFP , 
20 pHAT, pUC18, pFT-DEST™42GATEWAY (Invitrogen) , and the 
like . 

Examples of ^^recombinant vectors" for animal 
cells include, but are not limited to, pcDNAI/Amp, pcDNAI, 

25 pCDM8 (all commercially available from Funakoshi) , 
pAGE107 [Japanese Laid-Open Publication No. 3-229 
(Invitrogen), pAGElOB [J. Biochem. , 101, 1307(1987)], 
pAMo, pAMoA [J. Biol. Chem. , 268, 22782-22787(1993)], a 
retrovirus expression vector based on a murine stem cell 

30 virus (MSCV) , pEF-BOS, pEGFP, and the like. 

Examples of recombinant vectors for plant 
cells include, but are not limited to, pPCVICEn4HPT, 
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PCGN1548, pCGNl549, pBI221, pBI121, and the like. 

As used herein, the term ^^terminator" refers 
to a sequence which is located downstream of a 
5 protein-encoding region of a gene and which is involved 
in the termination of transcription when DNA is 
transcribed into mRNA, and the addition of a poly-A 
sequence. It is known that a terminator contributes to 
the stability of mRNA, and has an influence on the amount 
10 of gene expression. 

As used herein, the term ^^promoter" refers to 
a base sequence which determines the initiation site of 
transcription of a gene and is a DNA region which directly 

15 regulates the frequency of transcription- Transcription 
is started by RNA polymerase binding to a promoter. 
Accordingly, a portion having promoter function of a gene 
herein refers to ^^promoter moiety". A promoter region 
is usually located within about 2 kbp upstream of the first 

20 exon of a putative protein coding region. Theriefore, it 
is possible to estimate a promoter region by predicting 
a protein coding region in a genomic base sequence using 
DNA analysis software. A putative promoter region is 
usually located upstream of a structural gene, but 

25 depending on the structural gene, i.e., a putative 
promoter region may be located downstream of a structural 
gene. Preferably, a putative promoter region is located 
within about 2 kbp upstream of the translation initiation 
site of the first exon. Promoters include, but are not 

30 limited to for example, constitutive promoters, specific 
promoters and inductive promoters. 

As used herein, the term ^^enhancer" refers to 
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a sequence which is used so as to enhance the expression 
efficiency of a gene of interest. One or more enhancers 
may be used, or no enhancer may be used. 

5 As used herein, the term ^^silencer" refers to 

a sequence which has a function of suppressing and 
arresting the expression of a gene. Any silencer which 
has such a function may be herein used. No silencer may* 
be used. 

10 

As used herein, the term ^''operably linked'' 
indicates that a desired sequence is located such that 
expression (operation) thereof is under control of a 
transcription and translation regulatory sequence (e.g. , 

15 a promoter, an enhancer, and the like) or a translation 
regulatory sequence- In order for a promoter to be 
operably linked to a gene, typically, the promoter is 
located immediately upstream of the gene. A promoter is 
not necessarily adjacent to a structural gene. '^^Operably 

20 linked" also refers ,when herein used to refer to signal 
transduction, to that each signal transduction 
molecule interacts directly or indirectly via another 
molecule to contribute to the signal transduction. 

25 As used herein, technologies for introducing a 

nucleic acid molecule into a cell may be of any type, and 
include, for example, transformation, transduction, 
transfection and the like. Such a technology for 
introducing a nucleic acid molecule is well known in the 

30 art and is routinely used, and includes, for example, those 
described in Ausubel F. A. et al. ed. (1988), Current 
Protocols in Molecular Biology, Wiley, New York, NY; 
Sambrook J. et al. (1987) Molecular Cloning: A Laboratory 
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Manual^ 2nd Ed. and the third version thereof^ Cold Spring 
Harbor Laboratory Press^ Cold Spring Harbor, NY, Suppln, 
Experimental Medicine "Gene Introduction & Expression 
Analysis Experimental Procedure", Yodosha 1997. 
5 Introduction of genes may be confirmed by means of those 
described herein, such as Northern blotting. Western 
blotting analysis and other well known and routinely used 
technologies . 

Methods of introducing a vector is also 
achieved by any of the above-mentioned methods for 
introducing a DNA into a cell, and include for example, 
transf ection, transduction, transformation and the like, 
such as calcium phosphate, liposome methods, DEAE dextran 
methods, electroporation methods, particle gun methods 
(gene gun), and the like, lipofection, spheroplast Proc. 
Natl. Acad. Sci. USA, 84, 1929 (1978)], lithium acetate 
method [J. Bacteriol., 153, 163 (1983)], a method 
described in Proc. Natl. Acad. Sci. USA, 75, 1929 (1978) 
and the like. 

As used herein, the term ^^gene introduction 
reagent" refers to a reagent which is used in a gene 
introduction method so as to enhance introduction 
25 efficiency. Examples of such a gene introduction reagent 
include, but are not limited to, cationic polymers, 
cationic lipids, polyamine-based reagents, 

polyimine-based reagents, calcium phosphate, and the like. 
Specific examples of a reagent used in transfection 
30 include reagents available from various sources, such as, 
without limitation, Effectene Transfection Reagent (cat. 
no. 301425, Qiagen, CA) , TransFast™ Transf ection Reagent 
{E2431, Promega, WI), Tfx™-20 Reagent (E2391, Promega, 
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WI) , SuperFect Transfection Reagent (301305, Qiagen, CA) , 
PolyFect Transfection Reagent (301105, Qiagen, CA) , 
LipofectAMINE 2000 Reagent (11668-019, Invitrogen 
corporation, CA) , JetPEI (x4) cone. (101-30, 
5 Polyplus-transf ection, France) and ExGen 500 (R0511, 
Fermentas Inc., MD) , and the like. In the present 
invention, such a gene introduction reagent may be used 
when introducing the nucleic acid molecule of the present 
invention into a cell. 

10 

Gene introduction efficiency may be calculated by 
measuring the cell number of introduction or expressionthe 
introduced foreign substance (introduced gene) (for 
example, gene product of a reporter gene, fluorescence 
15 protein GFP and the like) per unit area (for example, 1 mm^ 
and the like) ; intensity of total signal (in case of 
fluorescence protein, fluorescence) . 

As used herein, the term ^^transf ormant" 
20 refers to the whole or a part of an organism, such as a 
cell or a tissue, which is produced by transformation. 
Examples of a transformant include a prokaryotic cell, 
yeast, an animal cell, a plant cell, an insect cell, and 
the like. Transf ormants may be referred to as transformed 
25 cells, transf ormed tissue, transf ormed hosts, or the like, 
depending on the subject, and may refer to any specific 
form depending on the context. Cells used in the present 
invention may be a transf ormant - 

30 When a prokaryotic cell is used in genetic 

engineering in the present invention, prokaryotic cells 
include the following genera: Escherichia, Serratia, 
Bacillus , Brevibacterium, Corynebacterium, 
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Microbacteriuun, Pseudomonas, for example, those species 
including Escherichia coli XLl-Blue, Escherichia coli 
XL2-Blue, Escherichia coli DHl, as examples. 
Alternatively, in the present invention, cells isolated 
5 from a naturally occurring substance may also be used- 
Animal cells which can be used in genetic 
engineering or the like herein, include murine myeloma 
cells, rat myeloma cells, murine hybridoma cells, Chinese 

10 Hamster cells including CHO cell, BHK cell, African green 
monkey kidney cells, human leukemia cells, HBT5637 (see 
Japanese Laid-Open Publication 63-299) , human colon 
cancer cell line and the like. Murine myeloma cells 
include ps20, NSO and the like; rat myeloma cells include 

15 YB2/0 and the like; human fetal kidney cells include HEK293 
(ATCC: CRL-1573) and the like; human leukemia cells 
include BALL-1 and the like; African green monkey kidney 
cells include COS-1, COS-7 and the like; human colon cancer 
cell lines include HCT-15; human neuroblastoma cells 

20 include SK-N-SH, SK-N-SH-5Y and the like; murine 
neuroblastoma cells include Neuro2A and the like as 
examples. Alternatively, the present invention may use 
a primary cultured cell. 

25 Plant cells which can be used in genetic 

engineering herein include callus, or a portion thereof 
and suspension culture cells, those cells from Solanaceae, 
Gramineae, Brassicaceae, Rosaceae, ^ Leguminosae, 
Cucurbitaceae, Lamiacea, Liliaceae, Chenopodiaceae, 

30 Apiaceae and the like. 

Gene expression (e.g., mRNA expression, 
polypeptide expression) maybe ^Metected" or ^^quantif led" 
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by an appropriate method, including mRNA measurement and 
immunological measurement. Examples of molecular 
biological measurement methods include Northern blotting 
methods, dot blotting methods, PGR methods, and the like. 
5 Examples of immunological measurement method include 
ELISA methods, RIA methods, fluorescent antibody methods. 
Western blotting methods, immunohistological staining 
methods, and the like, where a microtiter plate may be 
used. Examples of quantification methods include ELISA 

10 methods, RIA methods, and the like. A gene analysis 
method using an array (e.g. , a DNA array, a protein array, 
etc.) may be used. The DNA array is widely reviewed in 
Saibo-Kogaku [Cell Engineering] , special issue, ^^DNA 
Microarray and Up-to-date PGR Method", edited by 

15 Shujun-sha. The protein array is described in detail in 
Nat Genet. 2002 Dec; 32 Suppl : 526-32 . Examples of methods 
for analyzing gene expression include, but are not limited 
to, RT-PGR methods, RAGE methods, SSGP methods, 
immunoprecipitation methods, two-hybrid systems, in 

20 vitro translation methods, and the like in addition to 
the above-described techniques. Other analysis methods 
are described in, for example, ^^Genome Analysis 
Experimental Method, Yusuke Nakamura's Lab-Manual, 
edited by Yusuke Nakamura, Yodo-sha (2002) , and the like. 

25 All of the above-described publications are herein 
incorporated by reference. 

As used herein, the term ''''expression" of a 
gene product, such as a gene, a polynucleotide, a 
30 polypeptide, or the like, indicates that the gene or the 
like is affected by a predetermined action in vivo to be 
changed into another form. Preferably, the term 
^^expression" indicates that genes, polynucleotides, or 
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the like are transcribed and translated into polypeptides . 
In one embodiment of the present invention, genes may be 
transcribed into mRNA. More preferably, these 

polypeptides may have post-translational processing 
5 modifications. 

As used herein, the term ^^expression level" 
refers to the amount of a polypeptide or mRNA expressed 
in a subject cell. The term ''^expression level'' includes 

10 the level of protein expression of a polypeptide evaluated 
by any appropriate method using an antibody, including 
immunological measurement methods (e.g., an ELISA method, 
an RIA method, a fluorescent antibody method, a Western 
blotting method, an immunohistological staining method, 

15 and the like, or the mRNA level of expression of a 
polypeptide evaluated by any appropriate method, 
including molecular biological measurement methods (e.g. , 
a Northern blotting method, a dot blotting method, a PGR 
method, and the like) . The term ^^change in expression 

20 level" indicates that an increase or decrease in the 
protein or mRNA level of expression of a polypeptide 
evaluated by an appropriate method including the 
above-described immunological measurement method or 
molecular biological measurement method. 

25 

Accordingly, as used herein, "reduction" of 
"expression" of a gene, a polynucleotide, a polypeptide 
or the like refers to when an agent of the present invention 
is subjected to an action, the amount of expression is 
30 significantly reduced compared to that when the agent is 
not subjected to an action. Preferably, the reduction of 
expression includes a reduction of the level of 
polypeptide expression. As used herein, the "increase" 
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of "expression" of a gene, a polynucleotide, a polypeptide 
or the like refers to when an agent of the present invention 
is subjected to an action (or an agent relating to gene 
expression into a cell, for example, a gene to be expressed 
5 or an agent for regulating the same) , the amount of 
expression is significantly increased compared to when 
the agent is not subjected to an action. Preferably, the 
increase of an expression includes an increase in the level 
of polypeptide expression. As used herein, the term 

10 ^""induction" of ^^expression" of a gene refers to an increase 
in the level of expression of the gene by acting an agent 
on a cell. Accordingly, the induction of expression 
encompasses the expression of the gene when no expression 
of the gene had been observed, and the increase in the 

15 level of expression of the gene when the level of the 
expression of the gene had already been observed. 

As used herein, the term ^^specifically 
express (ing) " of a gene refers to expression in a different 

20 level (preferably in a higher level) in a specific site 
or period of time than that of the other site or period 
of time. Specific expression may refer to expression in 
a certain site (specific site) or may also refer to the 
expression including that in another site. Preferably, 

25 specific expression refers to the expression in the 
certain site only. A gene to be introduced into a 
biological organism by the present invention may be 
modified such that specific expression is achieved. 

30 As used herein, the term ^^biological 

activity" refers to activity possessed by an agent (e.g. , 
a polynucleotide, a protein, etc.) within an organism, 
including activities exhibiting various functions such 
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as transcription promoting activity. When a collage 
interacts with the ligand thereof, the biological activity 
thereof encompasses formation of a conjugate or other 
biological change. In another embodiment, such a 
biological activity may be gene transposition activity 
and the like. Gene transposition activity may be 
determined by confirming the movement of a sequence 
encoding a gene of interest by any means. For example, 
when an agent is an enzyme, the biological activity thereof 
encompasses the enzymatic activity thereof. In another 
example, when an agent is a ligand, the activity 
encompasses the binding of the ligand to the receptor 
thereof. Such a biological activity may be determined by 
any well known technology in the art (see, for example. 
Molecular Cloning, Current Protocols, which is herein 
incorporated by reference) . 

As used herein, the term ^^kit" refers to a unit 
typically comprising two or more sections which provide 
20 portions (e.g., of a reagent, a particle, a cell, a nucleic 
acid and the like) . When components are not provided as 
a mixture and are preferably mixed immediately before use, 
this form of the kit is preferable. It is advantageous 
that such a kit preferably comprises instructions 
25 describing how to treat a portion to be presented (for 
example, a reagent, a particle and the like) . Such 
instructions may be of any medium, and includes, but is 
not limited to, for example, paper-medium^ transmitting 
medium, storage medium and the like. Transmitting media 
30 include, but are not limited to the internet, intranet, 
extranet, LAN and the like. Storage media include, but 
are not limited to CD-ROM, CD-R, flexible disk, DVD-ROM, 
MD, mini-disc, MO, memory stick and the like. 
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(Transgenic biological organism) 
A general technique for producing transgenic 
mice is described in International Publication WO91-13150 
5 (Ludwig Inst. Cancer Res.). US Patent No. 4,873,191 
(Wagner et al.) teaches a mammal having an exogenous DNA, 
which was obtained by microinjection of the DNA into a 
mammalian zygote. Further, a method of efficiently 
producing mutants of an animal, a plant, or the like has 
10 been studied, in which a transposable genetic element 
(transposon) is inserted or transposed into an endogenous 
DNA so that the structure of the DNA is changed and the 
DNA is thus inactivated. Transposons have been available 
for introduction, addition, and the like of a particular 
15 gene into a chromosome. 

In addition, a variety of methods for 
producing transgenic biological organisms include, but 
are not limited to those described in, for example: 

20 M.Markkula et.al., Rev.Reprod., 1, 97-106 (1996); R. T. 
Wall et al., J. Dairy Sci., 80, 2213-2224 (1997); J. C. 
Dalton, etal.. Adv. Exp. Med. Biol., 411, 419-428 (1997); 
and H. Lubon et al., Transfus. Med. Rev., 10, 131-143 
(1996), which are herein incorporated by reference, 

25 respectively. 

In such circumstances, in the last ten years, 
analysis of transgenic (including knock-out, knock-in) 
animal via homologous recombination of embryonic stem (ES) 
30 cells is becoming of note for the purpose of analysis of 
gene functions. 



In higher biological organisms, for example. 
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efficient selection of recombinants by means of positive 
selection using the neomycin resistance gene and negative 
selection using the HSV thymidine kinase gene or the 
diphtheria toxin gene are known. Homologous recombinants 
5 are selected by PGR or Southern blotting method, i.e., 
a portion of a target gene is replaced with the neomycin 
resistance gene or the like for positive selection, and 
at the bottom thereof, targeting vectors in which the HSVTK 
gene or the like is linked for negative selection at the 
10 terminus, to introduce the same into an ES cell by 
electroporation, and selected in the presence of G418 and 
gancyclovir. The resultant colonies are isolated and 
selected for homologous recombinants by means of PGR or 
Southern blotting. 

15 

As such, a method for producing a transgenic 
(targeted gene recombination) mouse having substitution 
or disruption of a internal target gene, and having lost 
the function thereof or having altered the mutation, is 
20 useful since mutations are introduced only in the gene 
which is targeted for the analysis of the gene function . 
Use of the present invention further enhances the 
transposition efficiency by methylation and thus the 
analytical efficiency of a gene will be greatly enhanced. 

25 

After selection of a desired homologous 
recombinant, the resultant recombinant ES cell is mixed 
with a normal embryo by the blastcyst injection method 
or the collection chimeric method to produce a chimeric 
30 mouse between the ES cell and the host embryo. In the 
blastcyst injection method, ES cells are injected into 
the blastcyst by a glass pipette. In the collection 
chimeric method, the mass of ES cell and an embryo of the 
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eight-cell phase, which has a removed clear zone, are fused. 
The blastcyst with the ES cell introduced therein is 
transferred to the uterus of a pseudo pregnant surrogate 
mother to produce a chimeric mouse. Since an ES cell has 
5 totipotency, it can differentiate in to any type of cell 
including a germline cell in vivo. When a chimeric mouse 
having the germline cell derived from the ES cell and a 
normal mouse are crossbred, then a mouse having the 
chromosome of ES cell in a heterologous manner, and a 

10 transgenic mouse having the modified chromosome of ES cell 
in a homologous manner shall be obtained by crossbreeding 
the mice to each other. In order to obtain a transgenic 
mouse having the modified chromosome in a homologous 
manner from the resultant chimeric mouse, a male chimeric 

15 mouse and a female wild-type mouse is crossbred to produce 
a Fl generation of heterozygous mouse, and the resultant 
male and female heterozygous mice are crossbred and select 
homozygous mouse in the F2 generation. Whether or not a 
desired gene mutation is introduced in each generation 

20 of Fl and F2, any methods routinely used in the art such 
as Southern blotting, PGR, sequencing, and the like as 
in the assays for recombinant ES cells. 

However, the production technology of a 
25 transgenic animal being presently conducted has a defect 
in that it is difficult to selectively analyze a variety 
of gene functions. There is also the disadvantage in that 
transgenic biological organism cannot be readily 
produced - 

30 

Further, production of present transgenic 
animals requires disruption from initiation or disruption 
and replacement with respect to a desired gene after 
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identification of such a desired gene as described above. 
Thus, it requires extreme labor and time, and even those 
researchers familiar with the technology do not always 
succeed. Accordingly, it is still a labor-intensive 
5 operation. 

As such, in order to overcome the problem in 
which a variety of gene functions cannot be selectively 
analyzed, it is of note that Cre recombinase cell-type 

lO specific expression and Cre-loxP site specific 
recombination are combined. Transgenic mice using 
Cre-loxP are produced by introducing the neomycin 
resistance gene in a location, such that the expression 
of a target gene is not blocked, introducing a targeting 

15 vector into an ES cell, the vector containing the loxP 
sequence introduced therebetween in a manner such that 
the exon to be deleted is sandwiched, and isolating the 
homologous recombinant. The isolated clone is used to 
obtain a chimeric mouse, and a genetically modified mouse 

20 is produced. Next, when the mouse is cross bred with a 
transgenic mouse, which tissue-specifically expresses 
site specific recombinant enzyme Cre from PI phage of 
E. colir genes are disrupted only in the tissue expressing 
Cre (herein, Cre specifically recognizes loxP sequence 

25 (34bp) to raise recombination between the two lox P 
sequences, resulting in the disruption thereof. It is now 
possible to express Cre in an adult by crossbreeding the 
adult with a transgenic mouse having the Cre gene linked 
to an organ specific promoter or using a viral vector 

30 having the Cre gene. Further, the present invention is 
used to enhance the transposition efficiency by 
methylation, and thus the analytical efficiency of genes 
will be greatly enhanced. 
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The gene trapping (gene trap) method is of 
note as a method for analyzing a specific gene. In the 
gene trapping method, a reporter gene having no promoter 
5 is introduced into a cell, and once the gene is inserted 
into the genome in an accidental manner to express the 
reporter gene, which is used to isolate (trap) a novel 
gene. The gene trapping method is a method for efficient 
insertion mutation and identification of unknown gene, 

10 based on mouse primary embryo operation method, embryonic 
stem cell culture method and a gene targeting method via 
homologous recombination (Stanford WL., et al . , Nature 
Genetics 2:756-768(2001)). The gene trap method allows 
introduction of a gene, selection of a mutant and analysis 

15 of the phenotype with relative ease. 

In gene trap method, for example, a gene 
trapping vector having beta-geo, a fusion gene between 
lacZ and neo, has been linked between the 
20 splicing/acceptor sequence and the polyA addition signal, 
is introduced into an ES cell, and selected with G418 to 
allow selection of the clones which have accidentally 
trapped the gene expressed by the ES cell. 

25 Production of a chimeic embryo from thus a 

obtained clone, a variety of X-gal stain patterns will 
be shown depending on the expression patterns of the genes . 
As such, in the gene trapping method, unknown genes will 
be isolated and analyzed for the gene expression patterns 

30 thereof or the gene is disrupted. The present invention 
is used for enhancing transposition efficiency by 
methylation and the analytical efficiency of genes will 
also be greatly enhanced. 
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According to the present invention, a 
^^transgenic biological organism having a transposase 
gene" and a ^^transgenic biological organism containing 
5 a nonself-contained transposon" can be crossbred to obtain 
a ^'biological organism containing the transposase gene 
and the transposon'' . As used herein the term "non-self 
completing transposon" refers to one which is not 
transposable per se. The term "self-completing 

10 transposon" refers to one which can transpose per se. 
With this method, similar parents can be crossbred to 
obtain child mammals having an identical gene. This 
method makes it possible to know in advance the influence 
of introduction of only a transposon construct into a 

15 mammal on a phenotype of the mammal. Similarly, the 
method makes it possible to know in advance the influence 
of introduction of only a transposase construct into a 
mammal on a phenotype of the mammal. Alternatively, it 
is possible to obtain a ^^transgenic biological organism 

20 containing a transposase gene and a transposon" into which 
the transposase gene and the transposon are initially 
introduced without crossbreeding. This method does not 
require crossbreeding of parents, resulting in good 
efficiency in terms of labor, time, and cost. Accordingly, 

25 the present invention allows application of a production 
method for such a transgenic biological organism. 

In this ^^transgenic biological organism 
containing a transposase gene and a transposon", since 
30 the transposon is contained in a state that allows the 
transposon to be transposed, the transposon can be 
transposed on a chromosome. This transposition can 
disrupt, lower, or activate gene function at any site on 
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the chromosome. 

Further^ a ^^transgenic biological organism 
containing a transposase gene and a transposon" and a 
5 ^^biological organism containing no transposase" can be 
crossbred to obtain a ^^biological organism having a 
transposon but no transposase gene". If a transposase is 
interposed by loxPs^ a biological organism containing Cre 
may be used for crossbreeding. 

10 

In a preferable embodiment^ a ^^biological 
organism, in which substantially all cells thereof contain 
(i) at least one transposase gene and at least one 
nonself-contained transposon or self-contained 

15 transposon and (ii) at least one signature site" 
corresponds to a ^^transgenic biological organism having 
both a transposon sequence (TP) containing GFP as an 
arbitrary component and a transposase gene (SB) 
(hereinafter also referred to as a ^^TP-SB mammal") " . This 

20 transgenic biological organism is obtained by 
crossbreeding a ^^biological organism having a transposon 
sequence (TP) containing GFP as an arbitrary component 
but no transposase gene (SB) (hereinafter also referred 
to as a ^'TP (biological) organism mammal")" and a 

25 ^^biological organism having a transposase gene (SB) but 
no transposon sequence (hereinafter also referred to as 
an ^^SB (biological) organism")". 

In one embodiment, the biological organism of 
30 the present invention is induced from a stem cell or a 
fertilized egg having a transposon or a signature site. 
Therefore, essentially all cells contain a transposase 
gene. In fact, a transposon may not leave a signature site 
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when the transposon is excised and a transposon may not 
be transposed. Therefore^ item 2 describes 

^'substantially all cells". ^'Substantially all cells" is 
meant all cells except for such a particular cell(s) . In 
5 each cell of the above-described biological organism, a 
transposon is randomly transposed- For this reason, no 
uniform mutation is found in a whole individual among 
genetic mutations introduced by the transposon. 

10 In another embodiment, of the transgenic 

biological organisms of the present invention, a 
transgenic biological organism obtained by crossbreeding 
a ''TP-SB biological organism" and a "biological organism 
containing no transposase" is a transgenic biological 

15 organism in which a genetic mutation based on a signature 
sequence of the "TP-SB biological organism" has already 
been present since the fertilized egg stage and the common 
signature site is contained in substantially all cells 
of the biological organism individual. 

20 

In the present invention, a desired 
transgenic biological organism may be obtained by 
prescreening . As a prescreening method, a gene trap 
method can be used, for example ( Zambrowicz et al . , ; Nature, 
25 392:608-611 (1998); Gossler, A. et al.; Science, 
244:463-465 (1989); Skarnes, W.C. et al.; Genes Dev, 
6:903-918 (1992); and Friedrich, G. et al.; Genes Dev, 
5:1513-1523 (1991) ) . 

30 Thus, pre-screening is performed to select in 

advance transgenic biological organism desirable for 
clarification of gene function. Thereafter, 
crossbreeding over two or more generations or other 
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appropriate means can be performed to obtain a transgenic 
biological organism in which both genes on a pair of 
chromosomes are mutated. 

5 A method of analyzing the phenotype of a gene 

by disrupting the gene is an effective means for clarifying 
gene function. There are two big problems to be overcome 
in order to analyze phenotypes by exhaustive gene 
disruption for a mammal individual, particularly a mouse. 

10 The first problem is that there is no satisfactory 
technique for exhaustively disrupting genes so as to 
investigate^ gene function from phenotypes, i.e., 
so-called forward genetics- The second problem is that 
since there are a pair of genes (both alleles) , a phenqtype 

15 does not appear if only one member of the pair of genes 
is disrupted. Currently, individuals having one 
disrupted member of a pair of genes are crossbred in order 
to introduce a mutation into both alleles. In other words, 
a long time is required for crossbreeding to obtain an 

20 individual in which a mutation is introduced into both 
alleles . 

The first problem can be overcome by a 
transposon system newly developed in the present invention. 

25 The second problem can be overcome by a method of rapidly 
introducing a mutation into both alleles. As a specific 
method for overcoming the second problem, a Bloom gene 
knockout mouse, in which cells having a mutation in both 
alleles frequently appear, can be used (G. Luo et al.; 

30 Nature Genetics, 26:424-429 (2000)). Note that a perfect 
Bloom gene knockout mouse is fatal (N. Chester et al.; 
Gene and Dev., 12:3382-3393 (1998)), and therefore, the 
second problem may not be overcome . The present inventors 
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are producing a mouse, in which expression of the Bloom 
gene can be arbitrarily adjusted, using a tetOFF system 
(CT. Bond et al.; Science, 289:1942-1946 (2000)). The 
Bloom gene encodes DNA helicase. If the activity of DNA 
5 helicase is lost, sister chromatid exchange (SCE) occurs, 
and at the same time, exchange with another chromatid 
occurs. Therefore, the lack of the bloom gene causes 
recombination in tetraploids. In this case, a cell in 
which both members of a pair of genes are mutated may occur 
10 in a part of an individual. 

For example, if the Bloom gene is switched 
ON/OFF in a tetracycline dependent manner (A. Kistner et 
al.; Proc. Natl. Acad. Sci. USA, 93:10933-10938 (1996)), 

15 by adjusting the time of supplying tetracycline or the 
time of supplying no tetracycline, recombinations can be 
more frequently induced in a time-specific manner so that 
a cell having a pair of mutated genes is likely to occur. 
Therefore, a non-human mammal having a pair of mutated 

20 genes can be obtained without repetition of crossbreeding . 
As a method of introducing mutation in a time-specific 
manner, for example, pellets are continuously 
administered orally into a non-human mammal so that a fetus 
having a pair of mutated genes can be obtained. 

25 

A means for regulatably expressing the Bloom 
gene (e.g., a tetracycline regulatable unit ) is introduced 
in combination with a transposon system. For example, 
before crossbreeding, a means for regulatably expressing 
30 the Bloom gene is introduced into a fertilized egg or the 
like, into which a transposon construct, a transposase, 
a self-contained transposon, or the like is to be 
introduced. The obtained mouse having an introduced 
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transposon transposition site is treated with a means for 
inhibiting expression of the Bloom gene (e.g.^ 
administration of tetracycline) so that a genetic mutation 
obtained by a transposon system is introduced into both 
5 alleles^ thereby making it possible to rapidly determine 
phenotype. In the present invention, when no selectable 
marker gene is used, DNA may be extracted from cells of 
a non-human mammal and may be then screened by 
investigating the presence or absence of transposition 

10 by southern blotting. According to the present invention, 
it is possible to achieve efficient transposition of a 
transposon sequence in animals in vivo. Non-human 
mammals having various phenotypes can be efficiently and 
randomly obtained by a method of introducing mutation 

15 using a transposon, as compared to other methods. The 
transgenic non-human mammal of the present invention 
provides a considerably useful tool for clarifying complex 
life processes in gene function research since various 
genetic mutations can be introduced. 

20 

The frequency of expression of a transposon 
in cells is 3.5x10"^ per cell at maximum, which is 
considerably . low, as described in Proc. Natl. Acad. Sci. 
USA, vol. 95:10769-10773 (1998). In contrast, according 

25 to the present invention, the frequency of transposon 
expression among individuals, for example, was 42% of all 
mice and 80% (at maximum) of GFP gene-positive mice in 
an Example below. These values are significantly high. 
The present invention is the first to find that the 

30 transposition efficiency of a transposon was dramatically 
increased by the transposon expression system made of an 
aggregate of cells, such as animals or tissue, organs, 
and the like thereof- 
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According to one embodiment of the present 
invention, it is possible to screen transgenic non-human 
mammals having an introduced transposon construct for 
5 individuals having a randomly introduced mutation using 
a marker or other means. This is useful as means for 
clarifying gene function. For exhaustive analysis of 
gene function, it is necessary to cause a transposon to 
be transposed to a greater number of sites on a genome. 

10 According to item 2 of the present invention, it is 
possible to obtain a ^'transgenic non-human mammal, in 
which substantially all cells thereof contain (i) at least 
one transposase gene and at least one nonself-contained 
transposon or self-contained transposon and (ii) at least 

15 one signature site". For example, according to an example 
of the present application using mice, a signature site 
could be introduced at a rate of at least one per 10 cells, 
resulting in ^^seed mice" having a variety of cells in a 
mosaic pattern. Therefore, by producing mutated mice 

20 from different seed mice, it is possible to exhaustively 
introduce a mutation into substantially all genes the 
number of which is believed to be at least about 30,000. 
Therefore, in analysis of non-human mammals having 
mutations, since the present invention can achieve a 

25 considerably high level of expression frequency of genetic 
mutations, a number of functional changes by mutations 
can be simultaneously analyzed from if a single non-human 
mammal individual having a plurality of mutations is 
obtained. Thus, gene function can be highly efficiently 

30 clarified- When mice are used for introduction of 
mutations, for example, only one transgenic mouse is 
obtained from a single cell by a conventional method of 
introducing a mutation into an ES cell. The present 
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invention improves transfer efficiency by methylation^ 
and thus dramatically increases the level of gene analysis 
efficiency. 

5 On the other hand, according to the transgenic 

non-human mammal production method of the present 
invention, when a ^^transgenic non-human mammal (e.g., a 
mouse) , in which substantially all cells thereof contain 
at least one selected from the group consisting of at least 

10 one nonself -contained transposon and at least one 
signature site" is used as a seed mouse, it is possible 
for the single individual to give birth to 100,000 types 
of transgenic individuals. In other words, the present 
invention has merit in that a single mouse can give birth 

15 to offspring having a huge number of types of 
transpositions- Thus, it is possible to obtain various 
mutated individuals for clarification of life processes. 

According to the present invention, by 
20 obtaining and crossbreeding transgenic non-human mammals, 
it is possible to obtain non-human maimnals having fixed 
transpositions, which are useful for clarification of gene 
function. As used herein, ^^fixed transposition" means 
that the number of signature sites produced by 
25 transposition of a transposon is not increased due to the 
lack of an active transposase. Specifically, this 
indicates either the case where at least one signature 
site and a transposon are present but no or an inactivated 
transposase (s) is present or the case where at least one 
30 signature site is present but no transposon is present. 
If such a transgenic mammal individual is obtained, a type 
of gene function can be simply analyzed by investigating 
a corresponding single individual. It is also possible 
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to investigate an individual having a particular mutation 
about the influence of the mutation on the individual 
during the course of it's growth. Among the 
above-described non-human mammals, a non-human mammal 
5 lacking a certain function can be used to confirm a 
causative gene involved in a particular function as 
follows. For example, a transposon sequence containing 
a splice acceptor is used to add a transposase into a 
fertilized egg of the non-human mammal or crossbreed the 

10 non-human mammal with a non-human mammal having a 
transposase so as to remove the transposon sequence. It 
is then determined whether the function can be recovered 
as a result of the removal of the transposon sequence . 
In the present invention, mutations are introduced by 

15 transposons. Therefore, mutation introduction sites can 
be easily detected by an appropriate method, such as PGR 
or the like, using a signature sequence or a sequence 
derived from a transposon construct, as compared to when 
a mutation is introduced using a mutation inducing 

20 substance or the like. In an embodiment of the present 
invention, by introducing a genetic mutation into a 
non-human mammal. individual, but not culture cells, it 
is possible to analyze gene function in individuals. It 
is also possible to introduce a genetic mutation into in 

25 vivo tissue of a non-human mammal individual, which is 
difficult to handle while the non-human mammal individual 
remains alive, without external manipulation. Further, 
transposition sites differ even within the same tissue, 
so that there are genetically different cells. Therefore, 

30 the lineage of cells, such as proliferation, 
differentiation, and the like, can be systematically 
investigated in any tissue and organs, such as the blood 
system, the immune system, and the like. 
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According to the present invention, a novel 
biological organism (particularly, a mouse) of the present 
invention provides a model system useful for clarification 
5 of gene function. This embodiment of the present 
invention may provide a model system of disease for studies 
on genetic disease in in vivo animal models. In the system, 
examples of disease genes to be introduced into animal 
models include human disease causative genes, homologous 

10 genes of biological organisms with the human disease 
causative genes, full-length cDNA genes, cDNA gene 
fragments, full-length genomic DNA genes, and genomic DNA 
gene fragments. Such a disease causative gene is not 
particularly limited. Any disease causative gene can be 

15 used as long as it can be introduced into biological 
organisms and the resultant transgenic biological 
organisms can be studied as animal models of human disease. 
Human disease causative genes are preferable. According 
to one embodiment of the present invention, when a 

20 transposon containing various enhancers is transposed 
near proto-oncogenes, cancer is eventually expressed in 
cells containing these genes. Therefore, it is possible 
to perform screening for proto-oncogenes . In particular, 
when a transgenic biological organism containing a 

25 transposon sequence and a transposase gene is used, cancer 
undergoes metastasis over the whole body as well as tissue 
since proto-oncogenes are clonally expressed. At the 
same time, reduction, disruption, or activation of gene 
function due to transposition randomly proceed in each 

30 animal cell. It is expected that a plurality of cancers 
occur in the same individual. Therefore, clarification 
of gene function involved in cancer can be efficiently 
developed. Further, when a plurality of cancers are 
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confirmed in the same individual, it is possible to 
investigate whether or not cancerous cells are derived 
from the same cell by investigating whether or not the 
insertion site of a transposon vector is the same for the 
5 cancerous cells. Thus, the present invention may 
contribute to research on the mechanism of cancer 
metastasis. 

According to a third embodiment, the 
10 transgenic biological organism of the present invention 
may be used as a donor for organ transplantation. 
Examples of organs which are considered to be used as 
donors for heterograft to a human, include neurons, heart, 
lung, liver, pancreas, kidney, cornea, skin, and the like. 
15 In this case, as an introduced gene, a gene having a 
function of possibly reducing rejection or a gene having 
a function of expectably increasing acceptance are 
preferable in heterograft, for example. 

20 For production of transgenic biological 

organisms, refer also to: those references including, but 
not limited to: US patent Nos.: 5,464,764; 5,487,992; 
5,627,059; Japanese Laid-Open Publication 2001-54337; 
Gossler, A. et al . (1989), Science 244, 463-465; Wurst, 

25 W. et al. (1995), Genetics 139, 889-899; Zambrowicz, B. 
P. et al- (1998), Nature 392, 608-611 Proc.Natl. Acad. 
Sci. USA, Vol. 86, 8932-8935, 1989; Nature, Vol. 342, 
435-438, 1989; M. Muramatsu and M. Yamamoto ed. ""Jikken 
Igaku Bessatsu, Shin-tei, Idenshi Kogaku Handobukku 

30 Kaitei Daisanhan" (Experimental Medicine, Suppl. New 
Revision, Gene Engineering Handbook, Third Edition'^' (1999, 
Yodosha) , in particular, pages 239-256; S. Aizawa (1995) 
Jikken Igaku jiin taagettingu - ES saibo wo mochiita hen' i 
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mausu no sakusei" (Experimental Medicine, Gene Targeting 
- production of mutant mouse using ES cell) and the like. 

As used herein the term knock out", when 
5 referring to a gene, refers to rendering the disruption 
(deletion) of the gene or rendering function the gene 
deficient. Accordingly, the concept of knock out is 
encompassed by transgenic. 

10 As used herein, the term ^^knock-out 

biological organism" refers to a biological organism (for 
example, mouse) in which a gene is knocked out - 
Accordingly, the concept of knock-out biological organism 
is encompassed by a transgenic biological organism. 

15 

As used herein the term ^^biological organism" 
which is the object of the transgenic biological organism, 
encompasses any biological organism for which a transposon 
acts, and in which such a transgenic system can function. 
20 Such a biological organism includes, but is not limited 
to an animal, a plant, a bacteria and the like. 

As used herein the term ''''animal" refers to any 
animal, which can be targeted by the introduction of a 

25 nucleic acid sequence (preferably a foreign sequence 
encoding a gene) . Accordingly, an animal includes a 
vertebrate and invertebrate. An animal includes for 
example, mammals (for example, mouse, dos, cat, rat, 
monkey, pig, cattle, sheep, rabbit, dolphin, whale, goat, 

30 horse and the like) , birds (for example, chicken, quail 
and the like) , amphibian (for example, frog and the like) , 
reptiles, insects (for example, Drosophila and the like) , 
and the like. Preferably, an animal may be a mammal, and 
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preferably^ an animal^ which is amenable to production 
of a knock-out biological organism (for example, mouse) . 
In another preferable embodiment, an animal may be an 
animal which is known to be appropriate as a human model 
5 animal (for example, monkey). In an embodiment, an animal 
may be, but is not limited to: non-human animal or 
non-human mammal. An animal may be, for example, pig, 
monkey, cattle, horse, goat, sheep, cat, dog, rabbit, 
mouse, rat, or hamster and the like, and more preferably, 

10 mouse or rat. As used herein, the biological organism of 
the present invention, unless otherwise stated, includes 
not only mammalian individuals, but also a part of an 
individual, or organs or tissue possessed by an individual . 
These may be useful as a human disease model or a donor 

15 for organ transplantation. 

As used herein the term "plant" collectively 
refers to an organism belonging to the kingdom of Plantae 
and is typically characterized in chlorophyl, hard cell 

20 wall, presence of abundant permanent embryonal cells, and 
incapability of movement or the like. Typically, plant 
refers to Phanerogamae having formatino of cell wall an 
anabolism action by chlorophyll. "Plant" encompasses 
both monocotyledonous plant and dicotyledonous plant. 

25 Preferably plants include, but are not limited to, for 
example, monocotyledonous plants belonging to Gramineae 
such as rice, wheat, maize, barley, sorghum, and the like. 
Preferably, plant may be rice. Rice includes but is not 
limited to japonica and indica variants. More preferably, 

30 rice may be japonica variant. As used herein variants of 
rice include but are not limited to, for example, 
Nipponbare, Nihonmasari, Kinmaze, Norin No. 22, 
Chiseiasahi, Koshihikari, Akitakomachi, Dontokoi, 



107 



KJ007 



Hinohikari and the like. Indica variants include, but are 
not limited to Tetep, Basmati, IR8, Hunanzao, and the like. 
Preferable plants are not limited crops, but also flowers, 
trees, turfs, weeds and the like. Unless otherwise 
5 stated, plant refers to any of plant body, plant organ, 
plant tissue, plant cell, and seed. Examples of plant 
organs include root, leaf, stem and flower and the like. 
Examples of plant cells include callus and suspended 
culture cells. 

10 

Examples of Gramicear plants include plants 
belonging to Oryza, Hordenum, Secale, Scccharum, 
Echinochloa, or Zea, and include rice, barley, rye, 
Japanese millet, sorghum, maize and the like. 

15 

Plants used for a method for production 
according to the present invention are preferably 
monocotyledonous plant, and more preferably Gramineae 
plant- More preferably, it may be rice. 

20 

In the above-mentioned organisms, introduction 
technology of a gene includes a method selected from the 
group consisting of microinjection, a combination of a 
nucleic acid fragment and a cationic lipid vesicle or DNA 
25 aggregation reagent, and introduction of a nucleic acid 
fragment to a viral vector followed by contact with a cell 
with the virtual vector, and particle bombardment and 
electroporation . 

30 Viral vectors which may be used herein, 

include but are not limited to: retroviral vector, 
adenovirus vector, herpes virus, and adeno-associated 
vector and the like. 
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As used herein the term ^^retrovirus" refers 
to a virus which has a genetic information in the form 
of RNA, and synthesize a DNA from the information of the 
5 RNA via reverse transcripitase . Accordingly, 
^^retroviral vector" refers to a form of a retrovirus which 
is used as a vector for a gene. '''Retroviral vectors" as 
used herein include, but are not limited to, for example, 
retroviral type expression vector based on Moloney Murine 
10 Leukemia Virus (MMLV) , Murine Stem Cell Virus (MSCV) and 
the like. 

Preferably, retroviral vectors include, but 
are not limited to: pGen-, pMSCV and the like. 

15 

As used herein the term ^^gene trap (method) " 
refers to a method for identification of a gene using the 
fact that a desired cell is introduced with a reporter 
gene lack of promoter, for example, and reporter activity 

20 is only detected when the reporter gene is inserted 
downstream of a promoter in an activated form in the 
chromosome. Such a gene trap is achieved by introducing 
a ^'gene trap vector" into the host chromosome of a 
eukaryotic organism and disrupting the host gene. A gene 

25 which was introduced with a reporter gene, expresses a 
complex protein with a reporter, and thus it is capable 
of identifying a gene by monitoring the protein. 
Accordingly, a reporter gene is incorporated into the 
original locus as in the homologous recombination, it is 

30 possible to produce a complete reporter system with 
respect to the transcription regulation. By means of 
these methods, it is possible to identify a gene which 
cannot be obtained by a method for isolation of a mutants 
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via gene disruption. Accordingly^ the present invention 
can use of these gene trapping method. 

As used herein the term ^^gene trap vector" 
5 refers to a vector for selection of a vector inserted into 
a gene, using a phenomenon in which in the process of mRNA 
of a eukaryotic organism gene is matured into a mature 
mRNA, splicing mechanism is taken place. Gene trap 
vectors include, but are not limited to (1) a vector 

10 comprising a coding region of a reporter gene having no 
promoter, and a DNA sequence comprising a splice-acceptor 
sites, or (2) a vector comprising a coding region of a 
reporter gene having promoter, and a DNA comprising a 
splice-donor sites, and (3) a vector comprising the DNA 

15 sequence of both (1) and (2), and the like. 

Gene trapping vectors comprising 

splice/acceptor sequence as described above, may comprise 
polyA addition signal as necessary. A gene trapping 
20 vector comprising a splice/donor sequence may comprise 
enhancer region, and/or mRNA instability region, as 
necessary. PolyA addition signal includes, but is not 
limited to: "AATAAA" . 

25 Promoters used in the present invention 

include but are not limited to: MCI promoter, RNA pol II 
promoter and the like. 

Enhancers used in the present invention 
30 include but are not limited to polyoma viral enhancer 
(PYF441) and the like. 

Splice donor sequence used in the present 
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invention include but are not limited to murine hprt gene 
exon 8 splice donor. 

Splice acceptor sequence used in the present 
5 invention include^ but are not limited to human bcl-2 gene 
exon 3 splice acceptor. 

As used herein the term "reporter" molecule 
or "reporter" gene refer to a molecule (e.g. polypeptide) 

10 or gene which can be used as an indicative of gene 
expression in a cell. Such a molecule may be of known 
reporter protein, and includes, but is not limited to, 
for example, chloramphenicol acetyl transferase (CAT) , 
beta-glucuronidase (GUS) , beta-D~galactosidase, 

15 lucif erase, green fluorescence protein (GFP) , or aequorin 
and the like As used herein, a method for introducing 
a gene per se may be achieved by means of desired material 
using known technology in the art. In such a case, for 
example, an embryonic stem cell of interest was introduced 

20 with a reporter gene free of a promoter (e.g., luciferase, 
green fluorescence gene, beta-galactosidase gene (lacZ) , 
alkaline phosphatase gene, Cre recombinase gene and the 
like) , and reporter activity will only be detected when 
inserted downstream of an activated promoter on the 

25 chromosome. Vectors used may include, for example, the 
presently mentioned reporter gene, selectable marker gene 
(e.g. , neomycin resistant gene, hygromycin resistant gene, 
puromycin resistant gene, rescue marker gene (e.g., 
ampicillin resistant gene and collicin El replication 

30 origin) and the like. Selectable marker gene is used for 
selecting a host with the vector. Rescue marker gene is 
used for rescuing a vector (see Joyner, A. L. ed. "Gene 
Targeting, 2"^^ edition" (Oxford University Press, 2000) ) . 
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Using technologies as described above^ an embryonic stem 
cell is produced. The modified embryonic stem cell has 
trapped a gene. As used herein the term "trap" refers to 
the state where an internal gene is disrupted by insertion 
5 of a trapping vector into the genome, and the gene 
disrupted by the gene is marked at the same time. 

Preparation of an oligonucleotide having a 
specific sequence may be achieve by any well known 

10 technology in the art and include, but are not limited 
to: e.g. those described in Joyner, A. L. ed. "Gene 
Targeting, 2""^ edition" (Oxford University Press, 2000) . 
Oligonucleotides are labeled as necessary with a 
fluorescence, radiolable and the like. Such a labeling 

15 method are well known in the art, and described in the 
references herein cited. 

(Screening) 

As used herein, the term ^^screening" refers 
20 to selection of a target, such as an organism, a substance, 
or the like, a given specific property of interest from 
a population containing a number of elements using a 
specific operation/evaluation method. For screening, a 
method or system of the present invention may be used. 
25 In the present invention, as a variety of transgenic 
biological organisms are produced, any nucleic acid 
molecule and a functional regulation agent may be 
screened. 

30 In the present invention, any nucleic acid 

molecules may be screened by means of a nucleic acid 
molecule, a method or a system of the present invention. 
The present invention is also intended to comprise 
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chemicals identified by the screening or the combination 
thereof. 

A transposon system according to the present 
5 invention may be used in a variety of fields . For example, 
1) the present invention is used to efficiently insert 
genetic material into a chromosome of a biological 
organism; 2) transposon is used as an insertion mutation 
agent to identify, isolate and characterize the genes 

10 relating to growth, maintenance, regulation and 
development of an organism (e.g. Kaiser et al., 1995 
"Eukaryotic transposable, elements as tools to study gene 
structure and function" Mobile Genetic Elements, IRL 
Press, pp. 69-100) ; 3) it is possible to identify, isolate 

15 and characterize the transcriptional regulatory factors 
relating to growth, maintenance, regulation and 
development of an organism (e.g., Anderson et al., 1996, 
Mol. Mar. Biol. Biotech., 5, 105-113). As an example, a 
method and system of the present invention may be used 

20 to produce a germ-free transgenic mouse. Litter-mates 
having an activated gene are crossbred to allow production 
of germ-free ascendants for biological containment or 
maximizing the growth rate. 

25 (Genetic Therapy) 

Use of the present invention includes 
incorporation of a gene for genetic therapy to a cell by 
modifying nucleic acid fragment. Such a gene is located 
under the control of a tissue-specific promoter or 

30 universal promoter, or under the control of one or more 
other expression controlling regions for expression of 
a gene in a cell requiring the gene. Genes used for genetic 
therapy include but are not limited to, for example, CFTR 
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gene for cystic fibrosis, alpha-l~antitrypsin for lung 
diseases, adenosinaminase (ADA) for immunological 
diseases. Factor IX and interleukin-2 (IL-2) for blood 
cell diseases, and tumor necrosis factor (TNF) for cancer 
5 treatment and the like. 

Gene sequence possibly used for genetic 
therapy can be obtained by searching a known database such 
as GenBank, DDBJ, EMBL and the like. 

10 

Further, the present invention may be used for 
operating or screening a library or a part thereof, 
evaluating a function of a sequence, or screening a protein 
expression, evaluating effects of a particular protein 

15 or a particular expression controlling region on a 
particular cell type. In one embodiment, libraries of 
recombinant sequences, for example, those products of 
combinatorial library or gene shuffling can be 
incorporated in to the nucleic acid fragments of the 

20 present invention to produce a library of nucleic acid 
fragment having a variety of nucleic acid sequences 
located between certain inverted repeat sequences. Next, 
this library is introduced into a cell with a transposase 
such as the SB protein as described above. 

25 

The advantages of the present invention is 
that the complete use of the advantages in the transposon 
transgenic biological organism production system which 
is free of size limitations of a mediated nucleic acid 
30 sequence located between the inverted repeat sequences. 
This is extremely enhanced by methylation. The SB protein 
is used for incorporation of a transposon of 1.3 kilobases 
(kb) to about S.Okb. The mariner transposase was used to 
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transpose about 13Jcb. There is no known limitation to the 
size of the nucleic acid sequence which can be incorporated 
into a cell of a DNA by means of SB protein. 

5 BEST MODE FOR CARRYING OUT THE INVENTION 

Hereinafter, preferable embodiments for carrying 
out the present invention are described. The embodiments 
provided below are only intended for better understanding 
10 of the present invention, and thus it should be understood 
that the scope of the present invention should not be 
limited to the description of the following section. 

In an aspect, the present invention provides 
15 an isolated nucleic acid molecule having a nucleic acid 
sequence encoding a transposon, wherein the nucleic acid 
sequence has a methylation at at least one nucleotide. 
Such a nucleic acid molecule, unexpectedly retains 
transposon activity and can be activated in the host, which 
20 are significant effects. In the present invention such 
an effect allows production of a transgenic biological 
organism for any type of organism, which is a significant 
utility. 

.25 In a preferable embodiment, the nucleic acid 

of the present invention has a nucleic acid sequence 
encoding a desired gene. As used herein such a desired 
gene is any gene, and may be appropriately selected 
depending on the application of the nucleic acid molecule 

30 to be used. 

In a preferable embodiment, it may be 
advantageous but is not limited to, that the methylation 
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in the nucleic acid molecule used in the present invention 
is present at least C in a CG sequence- Although not 
wishing to be bound by any theory, specific presence of 
methyl group at a CG sequence causes a phenomenon of 
5 heterochromatinization of the chromosome, resulting in 
rendering greater effects on the transposition efficiency 
of a transposon. 

Transposons used in the present invention may 
10 be in any form, and preferably, of DNA-type. If DNA-type 
is used, effects of methylation are amenable for 
achievement. Preferably, transposons used in the present 
invention belongs to Tcl/mariner type. Although not 
wishing to be bound by any theory, it is believed that 
15 transposons belonging to this type may heterochromatinize 
the chromosome when it is integrated into the chromosome, 
resulting in rendering greater effects on the 
transposition efficiency of a transposon. 

20 In the most preferable embodiment, the 

transposon used in the present invention comprises 
Sleeping Beauty. This particular transposon allows, as 
described elsewhere herein, utilization across the wall 
of species. 

25 

Preferably, in the nucleic acid molecule of 
the present invention, a desired gene may be operably 
linked to a transposon, or constructed in a manner such 
that it is operably linked to a transposon when introduced 
30 into a cell. In order that a transposon is operably linked, 
for example, appropriate location of an inverted repeat 
sequence and the like is performed, but it is not limited 
to this. 



116 



KJ007 



In a preferable embodiment^ the nucleic acid 
molecule of the present invention is used for introducing 
a foreign gene into a host. The introduction of a foreign 
5 gene into a host is an unexpected effect even if the nucleic 
acid molecule of the present invention has observed an 
effect in vitro. In addition, in the circumstances where 
methylation mechanisms are unclear, there is no key for 
expectation, and thus the efficient introduction of a 
10 foreign gene by methylation of the present invention is 
a significant effect. 

As used herein, the biological organism 
(host) targeted by the present invention is any biological 

15 organism as long as a transposon operates, and includes 
eukaryotes but is not limited thereto. Preferably, the 
host targeted by the present invention includes, but is 
not limited to mammals (for example, rodents such as mouse, 
rat; primates and the like) . Whether or not the functions 

20 of a transposon can be confirmed by conducting an animal 
test . 

In a preferable embodiment, with respect to 
the nucleic acid molecule of the present invention, it 
25 is advantageous that a transposase acts on the location 
on the genome for which the nucleic acid molecule is 
inserted. Such a constitution allows smooth 

transposition of a gene. 

30 In another aspect, the present invention is 

directed to a gene cassette having a nucleic acid sequence 
encoding a transposon, wherein said nucleic acid sequence 
has a methylation at at least one nucleotide. The nucleic 
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acid molecules to be included may have the features as 
described herein above. As used herein the term "gene 
cassette" may attach another element . 

5 In another aspect, the present invention 

provides a vector having a nucleic acid sequence encoding 
a transposon and a nucleic acid sequence encoding a desired 
gene, wherein said nucleic acid sequence has a methylation 
at at least one nucleotide. The sequence encoding a 
10 transposon and methylation are described herein elsewhere. 
A vector has usually circular shape but is not limited 
thereto. A vector may include elements for regulating 
transcription, translation or expression. Such an 
element is preferably operably linked. 

15 

Preferably, the vector uses introduction of 
a foreign gene into a host. The vector of the present 
invention achieved an effect which allows production of 
a transgenic biological organism of a species for which 

20 it was impossible or difficult to produce. In particular, 
in mammals which had been believed that no transgenic 
biological organisms to date, SB allows production of a 
transgenic biological organism using a form of vector, 
which means that a system in which the foreign gene can 

25 be readily inserted is readily available. Accordingly, 
such a system has great utility. 

The vector of the present invention can be 
constituted such that a transposase acts on the location 
30 of the genome for which a nucleic acid sequence is 
inserted. 

As used herein, the term ^^signature site" 
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refers to a site which emerges as a result of excision 
and transposition of a transposon. For example, when an 
SB transposon is used in the present invention, a signature 
site comprises a sequence ^'TAcagTA" or ^'TActgTA" where 
5 three bases of the terminus sequence of the transposon 
is inserted into a TA repeat of a target sequence. Note 
that even when a transposon is moved, a resultant signature 
site may not have the above-described perfect specific 
sequence. In the present invention, a site having such 
10 an imperfect sequence is regarded as a signature site. 

In another embodiment, in the present 
invention, a vector is constructed such that a transposase 
can act on the site on the genome to which the nucleic 
15 acid of the present invention is inserted. Such construct 
may be achieved by methylation of at least a portion of 
the sequence encoding a transposon in the present 
invention. 

20 In another embodiment, the present invention 

provides a composition for acting a transposase onto a 
foreign nucleic acid molecule to be inserted into the 
genome. The composition is characterized in that nucleic 
acid sequence encoding a transposon and the foreign 

25 nucleic acid molecule are included, and that the sequence 
encoding the transposon is methylated. As used herein, 
the foreign nucleic acid molecule may or may not be 
methylated 

30 In another aspect, the present invention 

provides a cell comprising a nucleic acid molecule having 
a nucleic acid sequence encoding a transposon and a nucleic 
acid sequence encoding a desired gene, wherein said 
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nucleic acid sequence has a methylation at at least one 
nucleotide. Such a cell allows ready production of a 
transgenic biological organism in the present invention. 
Accordingly, the present invention has significant 
5 utility. The nucleic acid molecule which can be 
encompassed by the cell of the present invention may have 
any sequence as long as the sequence comprises those 
encoding a transposon as described herein above. 
Preferably, such a sequence functions in the cell to be 
10 retained. Preferably, such a cell is used for introducing 
a foreign gene into a host. Such a host may preferably 
but no necessarily be of the same species as the cell. 

The cell of the present invention may of any 
15 type, and preferably include but is not limited to 
eukaryotic cells, and more preferably includes but is not 
limited to mammalian cell, and still more preferably 
includes but is not limited to rodent cells. In a more 
preferable embodiment, it is useful to use a model animal 
20 such as a mouse, rat and the like. The cell of the present 
invention should be determined in relation to the property 
of the nucleic acid molecule to be introduced, object, 
and the host into which the nucleic acid molecule is 
introduced- The nucleic acid molecule to be included in 
25 the cell of the present invention may be the vector of 
the present invention. 

In another aspect, the present invention 
provides a tissue comprising a nucleic acid sequence 
30 encoding a transposon and a nucleic acid sequence encoding 
a desired gene, wherein said nucleic acid sequence has 
a methylation at at least one nucleotide. Such a tissue 
allows ready production of a transgenic biological 
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organism in the present invention. Accordingly, the 
present invention has significant utility. The nucleic 
acid molecule which can be encompassed by the tissue of 
the present invention may have any sequence as long as 
5 the sequence comprises those encoding a transposon as 
described herein above. Preferably, such a sequence 
functions in the tissue to be retained. Preferably, such 
a tissue is used for introducing a foreign gene into a 
host. Such a host may preferably but no necessarily be 
10 of the same species as the tissue. 

The tissue of the present invention may of any 
type, and preferably includes, but is not limited to 
eukaryotic tissues, and more preferably includes, but is 

15 not limited to mammalian tissue, and istill more preferably 
includes, but is not limited to rodent tissues. In a more 
preferable embodiment, it is useful to use a model animal 
such as a mouse, rat and the like. The tissue of the 
present invention should be determined in relation to the 

20 property of the nucleic acid molecule to be introduced, 
object, and the host into which the nucleic acid molecule 
is introduced. The nucleic acid molecule to be included 
in the tissue of the present invention may be the vector 
of the present invention. 

25 

In another aspect, the present invention 
provides a biological organism comprising a nucleic acid 
sequence encoding a transposon and a nucleic acid sequence 
encoding a desired gene, wherein said nucleic acid 
30 sequence has a methylation at at least one nucleotide. 
Such a biological organism allows ready production of a 
transgenic biological organism in the present invention. 
Accordingly, the present invention has significant 
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utility. The nucleic acid molecule which can be 
encompassed by the biological organism of the present 
invention may have any sequence as long as the sequence 
comprises those encoding a transposon as described herein 
5 above. Preferably, such a sequence functions in the 
biological organism to be retained. Preferably^ such a 
biological organism is used for introducing a foreign gene 
into a host. Such a host may preferably but not 
necessarily be of the same species as the biological 
10 organism. 

The biological organism of the present 
invention may of any type, and preferably includes, but 
is not limited to eukaryotic biological organisms, and 

15 more preferably includes but is not limited to mammalian 
biological organism, and still more preferably includes 
but is not limited to rodent biological organisms. In a 
more preferable embodiment, it is useful to use a model 
animal such as a mouse, rat and the like. The biological 

20 organism of the present invention should be determined 
in relation to the property of the nucleic acid molecule 
to be introduced, object, and the host into which the 
nucleic acid molecule is introduced- The nucleic acid 
molecule to be included in the biological organism of the 

25 present invention may be the vector of the present 
invention. 

In a preferable embodiment, the biological 
organism of the present invention is preferably not 
30 derived from the biological organism from which the 
desired gene is derived. In such a case, the desired gene 
is called a foreign gene. Those to be introduced as a 
foreign gene may be of any type and may vary depending 
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on the gene of .interest. 

In another aspect, the present invention is 
related to a method for producing a transgenic biological 
5 organism, comprising the steps of: A) providing an 
isolated nucleic acid molecule having a nucleic acid 
sequence encoding a transposon; B) transforming a germ 
cell of a desired biological organism with said nucleic 
acid molecule; C) selecting an individual in which the 
10 germ cell has a methylation in a nucleic acid sequence 
encoding said transposon; and D) regenerating a biological 
organism using the transformed germ cell. 

Provision of an isolated nucleic acid 

15 molecule having nucleic acid sequence encoding a 
transposon is known in the art, and may be achieved by 
means of well known technology. Transformation of a germ 
line cell of the desired biological organism with the 
nucleic acid molecule is also well known in the art and 

20 may be achieved by any methods including those gene 
recombination technology described herein. Individuals 
having the nucleic acid sequence encoding a transposon 
methylated in a germ line cell can also be selected by 
means of any well known technology in the art. 

25 Specifically, nucleic acid molecule (for example, DNA) 
is removed from the germ line cell so as not to disrupt 
methylation, and the nucleic acid molecule is subjected 
to demethylation and confirm whether the mass is changed 
or not and optionally the sequence of the methylated 

30 portion may be determined. Regeneration of the 
biological organism using the transformed germ line cell 
may also be conducted and can be achieved depending on 
the organism used for an appropriate method. 
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In a preferable embodiment^ the biological 
organism targeted by the transgenic biological organism 
of the present invention is eukaryotic. This is because 
5 the effects of methylation of transposon in the present 
'invention is more amenable. 

In another preferable embodiment, the 
biological organism to be targeted by the transgenic 
10 biological organism of the present invention includes 
mammals. This is because the effects of methylation of 
transposon in the present invention is more amenable. 
More preferably, the mammals may be rodent, and still more 
preferably, a model animal such as mouse, rat and the like. 

15 

In another aspect, the present invention 
provides a method for producing a transgenic biological 
organism comprising the steps of: A) providing an isolated 
nucleic acid molecule having a nucleic acid sequence 
20 ^ encoding a transposon, wherein the nucleic acid sequence 
has a methylation at at least one nucleotide; B) 
transforming a germ cell of a desired biological organism 
with said nucleic acid molecule; and C) regenerating a 
biological organism using the transformed germ cell. 

25 

Provision of an isolated nucleic acid 
molecule . having nucleic acid sequence encoding a 
transposon which is methylated at at least one nucleotide 
is described elsewhere herein and sufficiently described 
30 and may be achieved by means of well-known technology. 

In the method of the present invention, 
transformation of a germ line of the desired biological 
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organism with a nucleic acid molecule is also achieved 
by means of well-known technology in the art. 

In the method of the present invention^ 
5 regeneration of the biological organism using the 
transformed germ line cell may also be readily achieved 
by those skilled in the art by selection of an appropriate 
method depending on the organism. 

10 In another preferable embodiment, the 

biological organism to be targeted by the transgenic 
biological organisms of the present invention includes 
mammals- This is because the effects of methylation of 
transposon in the present invention is more amenable. 

15 More preferably, the mammals may be rodent, and still more 
preferably, a model animal such as mouse, rat and the like. 
The present invention allows production of transgenic 
animal of a model animal in a simple and automatic method 
having high probability. As such, the present invention 

20 has unexpected significant effects which has not been 
achieved by the conventional technologies. 

In another aspect, the present invention 
provides a kit for producing a transgenic biological 
25 organism, comprising A) an isolated nucleic acid molecule 
having a nucleic acid sequence encoding a transposon, 
wherein the nucleic acid sequence has a methylation at 
at least one nucleotide; and B) a transposase. 

30 The nucleic acid molecule to be included in 

the kit of the present invention are methylated at the 
portion encoding the transposon as described above, and 
may be a naturally-occurring one or may be produced in 
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an artificial manner by means of methylation enzyme. 
Further, any transposase may be used as long as it has 
an activity to the transposon to be included in the kit. 

5 In an embodiment, the kit of the present 

invention comprises an indication describing methods of 
using the nucleic acid molecule and transposase. The 
indication may be of paper medium, but may also be of 
transmitting medium such as information on a network. The 

10 indication describes handling of nucleic acid molecules, 
transformation methods, culture methods, regeneration 
methods, transposon incubation methods and the like, and 
a variety of protocols relating to transgenic biological 
organisms . The description may be of monolingual, but two 

15 or more languages may also be described. 

In another aspect, the present invention is 
related to use of an isolated nucleic acid molecule having 
a nucleic acid sequence encoding a transposon, wherein 

20 the nucleic acid sequence has a methylation at at least 
one nucleotide for producing a transgenic biological 
organism. Such use of a methylated nucleic acid molecule 
for transgenic biological organism was not known in the 
art, and not obvious therefrom. Description relating 

25 to such a nucleic acid molecule is as described herein 
above, and a variety of modification is possible. 

In another aspect, the present invention 
provides a nucleic acid fragment comprising a nucleic acid 
30 sequence located between at least two inverted repeat 
sequences, wherein the inverted repeat sequences have the 
capability of binding to a transposase, wherein the 
nucleic acid fragment is capable of being incorporated 



126 



KJ007 



into a DNA in a cell, wherein at least one nucleotide 
thereof is methylated. Inverted repeat sequences may be 
of any type, and for example, those set forth in SEQ ID 
NOs: 22-26. The nucleic acid sequence maybe of any type 
5 and include for example, a portion or the entire part of 
a foreign gene. Preferably, the nucleic acid sequence 
comprises at least one expression regulatory region. 
Such an expression regulatory region includes, but is not 
limited to for example, promoters, enhancers, or silencers 
10 and the like. 

In an embodiment, the nucleic acid fragment 
of the present invention further includes at least a 
portion of a foreign gene, wherein the nucleic acid 
15 sequence included in the nucleic acid fragment is operably 
linked to a sequence encoding at least a portion of the 
foreign gene. 

In an embodiment, the cell may be of any type, 
20 and preferably of an animal. In a preferable embodiment, 
cells used in the present invention is of vertebrate (for 
example, mammals such as primate or rodent (for example, 
rat, mouse and the like) ) . 

25 DNA of the cell used herein may be DNA present 

in any cell such as cellular genome, episome and plasmid 
and the like. 

At least one inverted repeat sequence used in 
30 the present invention may comprise sequence set forth in 
SEQ ID NO: 20 or 21 or a portion thereof. 

In one embodiment, the transposase used in the 
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present invention is SB protein. Transposases used in the 
present invention may have at least 80 %, preferably at 
least 90 %, more preferably at least 95 % amino acid 
homology to SEQ ID NO: 3. Alternatively, the nucleic acid 
5 molecule encoding the transposase used in the present 
invention is a sequence hybridizable to SEQ ID NO: 2 under 
stringent conditions, or hybridizable to the nucleic acid 
sequence encoding the amino acid sequence set forth in 
SEQ ID NO: 3 under stringent conditions. Alternatively, 
10 the transposase used in the present invention may have 
one or more amino acid substitutions, additions or 
deletions against the amino acid sequence set forth in 
SEQ ID NO: 3.. 

15 In a preferable embodiment, the at least one 

inverted repeat sequence included in the nucleic acid 
fragment of the present invention includes at least one 
direct repeat sequence, and the direct repeat sequence 
includes the nucleotide sequence set forth in SEQ ID NO: 

20 26 or at least 80 % homologous nucleotide sequence thereto. 

In another preferable embodiment, the at least one 
inverted repeat sequence included in the present invention 
comprises at least one direct repeat sequence, and 
25 preferably the direct repeat sequence may consist of the 
nucleic acid sequence set forth in or selected from the 
group consisting of SEQ ID NO: 22-25. 

In another aspect, the present invention 
30 provides a nucleic acid introduction system for 
introducing into a DNA of a cell another DNA, the system 
comprising A) a nucleic acid fragment comprising a nucleic 
acid sequence located between at least two inverted repeat 
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sequences, wherein the inverted repeat sequences have the 
capability of binding to a transposase, wherein the 
nucleic acid fragment is capable of being incorporated 
into a DNA in a cell, wherein at least one nucleotide 
5 thereof is methylated; and B) a transposase or a nucleic 
acid encoding a transposase. 

In a preferable embodiment, the transposase 
used in the system of the present invention may be SB 

10 nucleic acid molecule or SB protein, but is not limited 
thereto, and it is understood that any transposase used 
in the nucleic acid fragment of the present invention may 
also be used in the system. Alternatively, the 
transposase used in the present invention may have the 

15 amino acid sequence set forth in SEQ ID NO: 3 or a variant 
thereof, or the nucleic acid sequence encoding the 
transposase may be the .nucleic acid sequence set forth 
in SEQ ID NO: 2 or a variant thereof. 

20 As used herein the nucleic acid encoding the 

transposase used in the present invention may preferably 
incorporated into the cellular genome - 

In an embodiment, the system of the present 
25 invention further comprises a plasmid or viral vector^, 
and the plasmid or the viral vector includes the nucleic 
acid fragment as a part thereof. 

The nucleic acid fragment included in the 
30 system of the present invention may include at least a 
portion of the sequence encoding a foreign gene. 



In a preferable embodiment, the nucleic acid 
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fragment included in the system of the present invention 
is introduced into the cell by means of a method selected 
from the group consisting of particle bombardment; 
electroporation; microinjection; use of gene 
5 introduction reagents; and use of virus vectors. It 
should be understood that particle bombardment; 
electroporation; microinjection; use of gene 
introduction reagents; and use of virus vectors and the 
like are known in the art and any known technology may 
10 be used. 

In another aspect, the present invention 
provides a method for producing a transgenic biological 
organism, comprising the steps of: 

15 introducing a nucleic acid fragment and transposase 

into a pluripotent cell, wherein the nucleic acid fragment 
comprising a nucleic acid sequence located between at 
least two inverted repeat sequences, wherein the inverted 
repeat sequences have the capability of binding to a 

20 transposase, wherein the nucleic acid fragment is capable 
of being incorporated into a DNA in a cell, wherein at 
least one nucleotide thereof is methylated; and growing 
the cell into a living body. As used herein the 
introduction of the nucleic acid fragment and the 

25 transposase into a cell, may be achieved by means of any 
well-known technology of introducing a nucleic acid or 
a peptide. It should be understood that a method for 
growing a cell into a biological organism may also be 
achieved by means of an appropriate method such as the 

30 use of differentiation agents. Alternatively, it is 
understood that a pluripotent cell such as those selected 
from the group consisting of oocyte, embryonic cell, egg 
and stem cell may be used for obtaining a biological 
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organism by growing the same under a normal condition. 

Preferably, the biological organism used may 
be a rodent such as mouse or rat and a primate. 

5 

In another aspect, the present invention 
provides a method for introducing a nucleic acid into a 
DNA of a cell, comprising the step of: introducing a 
nucleic acid fragment into a cell, wherein the nucleic 

10 acid fragment comprises a nucleic acid sequence located 
between at least two inverted repeat sequences, wherein 
the inverted repeat sequences have the capability of 
. binding to a transposase, wherein the nucleic acid 
fragment is capable of being incorporated into a DNA in 

15 a cell, wherein at least one nucleotide thereof is 
methylated. 

In an embodiment, the present invention may 
further comprise the step of introducing a transposase 
20 into the cell. The transposase used herein may have at 
least 80 % homology to the sequence set forth in SEQ ID 
NO: 3 or any other variant. Such an example of variants 
are described and exemplified herein elsewhere. 

25 In an embodiment, the cell used in the present 

invention comprises a nucleic acid encoding transposase . 
The inclusion of such a nucleic acid avoids necessity of 
introduction of a foreign transposase. Preferably, the 
nucleic acid is incorporated in to the cellular genome. 

30 This is because it enables permanent expression thereof. 
Accordingly, such a transposase may preferably be stably 
expressed in the cell. Alternatively, the transposase 
may be operably linked to a inductive promoter in a 
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controllable manner, and temporarily expressed. It 
should be understood that stable or temporal expression 
of the nucleic acid may be appropriately selected for an 
appropriate use depending on the experiments of interest. 

5 

In the present invention, the sequence to be 
inserted preferably encodes a protein. Such a protein 
includes but is not limited to, for example, a marker 
protein, fluorescence protein, a physiological protein 
10 and the like. 

In another aspect, the present invention 
provides a method for transposing a nucleic acid sequence 
in a cell, comprising the step of: introducing a 

15 transposase into a cell comprising a nucleic acid fragment, 
wherein the nucleic acid fragment comprises a nucleic acid 
sequence located between at least two inverted repeat 
sequences, wherein the inverted repeat sequences have the 
capability of binding to a transposase, wherein the 

20 nucleic acid fragment is capable of being incorporated 
into a DNA in a cell, wherein at least one nucleotide 
thereof is methylated; wherein the transposase transposes 
the nucleic acid sequence from a first location in a DNA 
of the cell to a second location of the DNA. The 

25 transposition of the gene in the cell is a function which 
is advantageously achieved by the transposon system, and 
the methylation significantly enhances the mobility 
efficiency of the transposon system. 

30 In the transposition method of nucleic acid 

sequence of the present invention, the DNA of the cell 
is preferably a genomic DNA, and the first and second 
locations may be at epi chromosomal DNA, but the present 
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invention is not limited thereto. Epi chromosomal DNA 
involvement allows simple on-off switching of the gene. 
Transposase as used herein may preferably be introduced 
as a nucleic acid into the cell. 

5 

In another aspect, the present invention 
provides a method for identifying a gene in a cell, 
comprising the steps of: introducing into a cell a nucleic 
acid fragment comprising a nucleic acid sequence located 

10 between at least two inverted repeat sequences, wherein 
the inverted repeat sequences have the capability of 
binding to a transposase, wherein the nucleic acid 
fragment is capable of being incorporated into a DNA in 
a cell, wherein at least one nucleotide thereof is 

15. methylated, and a transposase; digesting the DNA in the 
cell with a restriction endonuclease capable of digesting 
the nucleic acid sequence; identifying the inverted repeat 
sequence; determining a sequence of a nucleic acid having 
similar sequence to the inverted repeat sequence; and 

20 comparing the sequence with sequence information in a 
sequence information database. It is understood that the 
introduction of the nucleic acid fragment and the 
transposase into a cell may be achieved by any well-known 
technology in the art. Endonucleases used herein may be 

25 of any type known in the art, and it is understood that 
the cleavage and digestion may be achieved by using, for 
example, a manual according to the manufacturer. It is 
understood that identification of an inverted repeat 
sequence may be achieved by means of identification of 

30 specific base length or restriction pattern, sequencing 
and the like. Once sequenced, it is possible to compare 
the determined sequence with a known database or a database 
which is produced by the practitioner using a method known 
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in the art- As such^ the function of a gene may be 
identified by such a comparison. It is understood that 
such identification may be achieved by, for example, a 
variety of biological assay or observation of phenotype 
5 thereof and the like. 

References such as scientific literature, 
patents, patent applications and the like will be 
incorporated herein by reference as if the entirety 
10 thereof is specifically described herein. 

Hereinafter, the present invention is 
described based on the Examples. The following Examples 
are provided only for the purpose of illustration. 
15 Accordingly, the scope of the claims of the present 
invention is not limited to -the description described 
above or following examples, but only by the appended 
claims . 

20 EXAMPLES 

Hereinafter the present invention is 
described by way of examples in detail, but the present 
invention is not limited to the following examples - 
Reagents used herein below in the Examples, are obtained 

25 from Sigma (St. Louis, USA), Wako Pure Chemical (Osaka, 
Japan) and the like unless otherwise stated. Handling of 
animals have been conducted under the provisions defined 
in Osaka University, Medical School. The method for 
producing an expression vector used in the present 

30 invention will be described in with specific examples. 
It will be readily conducted for those skilled in the art 
to replace elements, such as these start plasmid, 
promoters and the like with equivalents thereof. 
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The scope of the claims of the present 
invention is not limited to the description described 
above or following examples, but only by the. appended 
5 claims. 

(Materials and Methods) 

(Construction of plasmids and methylation) 

10 A targeting vector to introduce the SB 

transposon into the serine palmitoyltransferase 
long-chain base subunit 2 (Sptlc2) locus was generated 
by first cloning a 5-kb Xhol-Kpnl fragment of the Sptlc2 
gene containing exon 5 into the Xhol-Kpnl site of 

15 pBluescript II (pBS; Stratagene) , resulting in pBS-Sptlc2 . 
A Sall-BamHI fragment of pCX-EGFPPigA (Horie, K., 
A.Kuroiwa, M.Ikawa, M.Okabe, G.Kondoh, Y.Matsuda, and 
J.Takeda. 2001; Proc. Natl. Acad. Sci. USA 98:9191-9196) 
and a BamHI-NotI fragment of a 150-bp synthetic splice 

20 donor site (a gift from M. Ikawa) were then cloned into 
the Sall-NotI site of pBS, followed by the deletion of 
the BamHI site, to generate pCX-EGFP-SD. A blunted 
Sall-NotI fragment of pCX-EGFP-SD was inserted into the 
blunted EcoRI-BamHI site of pBS-IR/DR (R, L) to generate 

25 pTransCX-EGFP-SD. An EcoRI-NotI fragment of 
pTransCX-EGFPrNeo (Horie, K., A. Kuroiwa, M. Ikawa, M. 
Okabe, G. Kondoh, Y. Matsuda, and J. Takeda. 2001; Proc. 
Natl. Acad. Sci. USA 98:9191-9196) containing a PGK-neo 
cassette was cloned into pBS, after which the BamHI site 

30 was deleted to generate pBS-Neo. A Notl-Kpnl fragment of 
pBS-Neo and a KpnI-XhoI fragment of pTransCX-EGFP-SD were 
inserted into the Notl-Xhol site of pBS-Sptlc2, resulting 
in the targeting vector. 
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A vector for recombinase-mediated cassette 
exchange (RMCE) was constructed in the following manner. 
To remove the loxP site downstream of the PGK-neo cassette 
5 of pTransCX-EGFPrNeo, a BamHI-Afllll fragment and a 
Notl-Afllll fragment were religated, resulting in 
pTransCX-EGFP:Neo-3 ' lox. After blunt ending and NotI 
linker ligation of all cleavage ends, a Clal-Bglll 
fragment of pLlCMVEGFPlL was replaced with a Pvul fragment 
10 of pTransCX-EGFP: Neo-3 ' lox containing the SB transposon 
and a about 200-bp flanking sequence on both sides^ 
resulting in pLlTransCX-EGFPlL. 

TransSA^-geo and pTc3/SAp-geo were constructed by 
15 cloning a blunt-ended Xhol fragment of pROSAp-geo 
(Friedrich, G. and P. Soriano. 1991; Genes Dev. 5 : 1513'-1523) 
into the blunt-ended EcoRI-BamHI site of pTransCX-GFP 
(Horie^ K.^ A. Kuroiwa, M. Ikawa^ M. Okabe, G. Kondoh, 
Y. Matsuda^ and J. Takeda.2001; Proc. Natl. Acad. Sci. USA 
20 98:9191-9196) and the blunt-ended BspEI-NcoI site of 
pRP790 (Fischer, S.E- , H.G. van Luenen, andR.H. Plasterk. 
1999.; Mol. Gen. Genet . 262 : 268-274 .) , respectively - 

For the expression of a six-His-tagged 
25 123-amino-acid N-terminal fragment of the SB transposase 
(N123) , the N-terminal fragment was first PGR amplified 
from pCMV-SB (pSBlO) (19) with primers 

5'-CATGCCATGGGAAAATCAAAAGAAATC-3' (SEQ ID NO: 29) and 
5 • -CCGCTCGAGCAGTGGCTTCTTCCTTG-3 ' (SEQ ID NO: 30) and 
30 digested with Ncol, BsrGI, and Xhol . Then, the NcoI-BsrGI 
fragment and the BsrGI-XhoI fragment were cloned into 
Ncol- and Xhol-digested pET21d (Novagen) , resulting in 
pET-N123. 
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For the DNA probe of IR/DR-L for the 
electrophoretic mobility shift assay (EMSA) , the 
Hindi II-Kpnl fragment of pTransCX-EGFPiNeo was cloned 
5 into the Hindlll-Kpnl site of pBS, resulting in 
pBS~IR/DR-L. 

Methylation of plasmid DNAs was performed 
with SssI CpG methylase (NEB) according to the 
10 manufacturer's protocol^ followed by purification with 
a PGR purification kit (Qiagen) . Complete methylation was 
confirmed by resistance to digestion with 
methylation-sensitive enzymes . 

15 (Cell culture and gene targeting) 

Mouse ES cells were cultured in Dulbecco 
modified Eagle medium containing 20% fetal bovine serum^ 
nonessential amino acids, sodium pyruvate, and 1,000 U 
of leukemia-inhibitory factor/ml on mitomycin C-treated 

20 mouse embryonic fibroblasts. Mouse erythroleukemia (MEL) 
cell clones RL5 and RL6 (8) were cultured in Dulbecco 
modified Eagle medium 
containing 10% fetal bovine serum. 



25 Targeted integration of the SB transposon at 

the Sptlc2 locus was performed by means of 
insertional-type homologous recombination. Briefly, 25 
\iq of the targeting vector was linearized at the BamHI 
site located in the homologous region and introduced into 

30 l.Oxlo'' ES cells by electroporation (240 V, 500 \iF} with 
a Gene Pulser II (Bio-Rad) . Cells were selected for 7 days 
with 150 \ig of G418/ml, after which resistant clones were 
picked up, expanded, and analyzed 
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with Southern blotting. 

RMCE was performed as described previously 
(32) . Briefly, 25 \ig of methylated or unmethylated 
5 plasmids was introduced by electroporation (250 V, 1,070 
pF) into MEL cell clone RL5 or RL6 together with 20 pg 
of Cre expression vector pBS185 and 200 \xg of salmon sperm 
DNA. Cells were selected for 10 days, starting 4 days after 
electroporation, with 10 pM gancyclovir. Limiting 
10 dilution was performed for the isolation of independent 
clones, which were expanded and prescreened by PGR with 
the primers EGFP-IU (5'-CACCCTC GTGACCACCCTGACCTAC~3 ' ) 
(SEQ ID NO: 31) and EGFP-IL 

(5'-CTTGATGCCGTTCTTCTGCTTGTCG-3' ) (SEQ ID NO: 32) for the 
15 detection of enhanced green fluorescent protein (EGFP) 
and with the primers HYG-IU 

(5 ' -CGGGCGTATATGCTCCCCATTGGTCTTGAC-3 ' ) (SEQ ID NO: 33) 
and TK-IL (5 ' -TGGTGTAGATGTTCGCGATTGTCTCGGAAG-3 ' ) (SEQ ID 
NO: 34) for the detection of HYTK. The orientation of the 
20 exchanged cassette was determined with the primers M13F 
(5'-ACGACGTTGTAAAACGACGGCCAGT-3' ) (SEQ ID NO: 35) and 
RMCE-DLl (5'-GCATCGCCATGGGTCACGACGAGATCCTC-3' ) (SEQ ID 
NO: 36) for orientation D and with the primers M13F and 
RMCE-IL-1 ( 5 ' -AAGTGAGTTTAAATGTATTTGGCTAAGGTG-3 ' ) ( SEQ 
25 ID NO: 37) for orientation I. Cassette exchange and 
methylation statuses were further confirmed by Southern 
blot analysis. 

(Isolation of male germ cells) 
30 Testes of a mouse carrying donor transposons 

(Horie, K., A. Kuroiwa, M. Ikawa, M. Okabe, G. Kondoh, 
y. Matsuda, and J. Takeda. 2001; Proc. Natl. Acad. Sci . 
USA 98: 9191-9196) were decapsulated and chopped into 
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about l-inm2 fragments with a razor blade. Germ cells were 
released by repeated pipetting. After brief 
centrif ugation to remove seminiferous tubules, the 
supernatant was collected and. sedimented by further 
5 centrifugation at 1,500 x g. Then, DNA was extracted from 
the cell pellet and analyzed with Southern blotting. 

(Sourthern blotting analysis) 

Genomic DNA was digested with restriction 
10 enzymes, fractioned by 0,7 % agarose gel, and transferred 
to HyBond*N+ nylon membrane (Amersham) . 0.7 kb EcoRI 
fragment of EGEP containing EGFP (Okabe M.et al., FEES 
Lett. 407, 313-9 (1997) ) was used as a probe for detecting 
a transposon-specif ic band- Hybridization and washing 
15 were conduced according to a standard manual (J. Sambrook 
et al. supra) (1989)). In order to evaluate the 
transposon copy number of the transgenic mice, tail DNA 
bands were compared to the genomic DNA derived from the 
ES cell clone comprising the sole copy of the transposon. 

20 

(PGR analysis) 

For excision of the transposon, the following 
primer set was used for detection: TgTP-2L, 
5' -ACACAGGAAACAGCTATGACCATGATTACG-3' (SEQIDNO: 7) and 
25 TgTP-lU, 5 ' -GACCGCTTCCTCGTGCTTTACGGTATC-3 ' (SEQ ID NO: 
6) - Each primer used is present outside pTransCX-GFP : Neo 
IR/DR(R) and IR/DR(L). PGR was conducted using 
HotStarTaq system (Qiagen) using the following 
conditions : 

30 95 degrees Celsius, 15 minutes, 50 cycles of: 94 

degrees Celsius, one minute, 59 degrees Celsius, one 
minute, 72 degrees Celsius, one minute, followed by the 
final step: 72 degrees Celsius, one minute (one cycle) . 
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Genotyping of the transgenic mice was conducted 
using the following primer set: 

With respect to the GFP gene: EGFP-1U,5 * 
5 -CACGCTGGTGACCAGCCTGACCTA3' (SEQIDNO: 73) and EGFP-IL, 5' 
-CTTGATGCCGTTCTCTGCTTGTCG-3* (SEQ ID NO: 74) ; and for the 
SBtransgene: SB-2U, 5' -TCCTAGAGATGAACGTACTTTGGT-3' (SEQ 
ID NO: 75) and SB-IL, 5' -ATCCAGATAATTTTCCTTGCTCATG-3' 
(SEQ ID NO: 76) . 

10 

PGR conditions were the same as above, except for 
the annealing temperature of 55 degrees Celsius and the 
number of cycles to be 30. Resultant PGR product were 313 
bp for the GFP gene, and 4 66 bp for the SB transgene. 
15 Flanking sequence in the novel inserted or integrated 
sites in the transposon were PGR amplified as described 
previously (Ivicz, Z.et al.. Cell 91,501-10(1997)). PGR 
product was directly sequence using dye-terminator and 
ABI373A DNA sequencer (Applied Biosystems) . 

20 

(Determination of GFP expression) 
The tip of the tail of mice just after the birth was 
resected for observation using GFP • specified Filter 
(Olympus, Tokyo, Japan) and Olympus fluorescence inverted 
25 microscope. Mice having GFP expression in the entire tail 
was determined to be positive. 

(Example 1: Transient transposon excision 

assay) 

30 First, the present Example confirmed whether 

methylation of a sequence encoding a transposon enhances 
excision capability. 
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(Transient transposon excision assay) 
Transposon DNA (pTransCX-EGFP: neo, Horie et 
al., PNAS 2001, Proc. Natl. Acad. Sci. USA. ; 98 : 9191-6) 
has been methylated in advance with SssI CpG methylase 
5 (New England Biolabs;50mM NaCl, lOmMTris-HCl, pH7 . 9, lOmM 
MgCl2, ImM DTT, 160}aM SAM, 0 . 2U/pl SssI ) . Methylation has 
been verified by observing non digestion of the sample 
by methylation sensitive Notl. 

Next, Murine erythroleukemia cell MEL cell; 
J. Mol. Biol. 292: 779-785, 1999) was introduced with 
transposon DNA and Sleeping Beauty (SB) transposase 
(PCMV-SB, obtained from Dr. P.Hackett) . Total DNA was 
extracted from the cell usingDNeasy Tissue Ki (QIAGEN) , 
and PGR was conducted with the plasmid vector using primers 
(TgTP-lU, TgTP-2L (previously shown)) and 358 bp PGR 
product is detected, which is amplified when the excision 
reaction is caused. Detection was conducted by staining 
the agarose gel with ethidium bromide after the 
electrophoresis of the agarose gel. As a reference, size 
markers have also been electrophoresed. An example of the 
excision system is shown in Figure ID (a) . 

(Excision assay of transposon temporarily 
25 transfected) 

For excision assay of transposon temporarily 
transfected, 1.0 x 10^ cells of MEL cells were transfected 
using TransFast (Promega) with 1.0 jjl g of methylated or 
unmethylated pTransCX-EGFP:Neo together with 1.0 jjl g of 
30 pSBlO or pBS. For excision assay of a transposon which 
has transiently transfected, I.O^IO^ cells of MEL cell 
have been tranfected with methylated or unmethylated 
pTransGX-EGFP:Neo (1.0 pg) together with pg of pSBlO or 
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pBS. 

Fourty-eight hours after the transf ection, DNeasy 
kit (QIAGEN) was used to extract genomic DNA. Four 
5 hundred ng and 80 ng of Hindlll digested DNA were used 
to the amplified excised products and neo fragments using 
HotStarTaq systemn (QIAGEN) . neo fragment is present in 
pTransCX-EGFPiNeo vector, and was used as an internal 
standard for transfection efficiency- Primer TgTP-lU 

10 (5' -GACCGCTTCCTCGTGCTTTACGGTATC-3' (SEQ ID NO: 6)) and 
primer TgTP-2L (5' -ACACAGGAAACAGCT ATGACCATGATTACG-3 ' 
(SEQ ID NO: 38) ) was used to detect the excised products, 
and primer neo-Ul(5' -GGGTGGAGAGGCTATTCGGCTATGA-3' (SEQ 
ID NO: 39)) and primer 

15 neo-Ll (5' -TGGATACTTTCTCGGCAGGAGCAAG-3' (SEQ ID NO: 40)) 
were used to amplify neo fragments. The amplification of 
the resected products were conducted as follows: 95 
degrees Celsius for fifteen minutes, followed by 35 cycles 
of 94 degrees Celsius for one minute, 63 degrees Celsius 

20 for one minute, and 72 degrees Celsius for one minute, 
and finally the final extension at 72 degrees Celsius for 
seven minutes. The same conditions were used for 
amplification of neo fragment except for that annealing 
temperature is 60 degrees Celsius and the number of cycles 

25 was 19. 

The excised product and neo fragment, 
according to manufacturer's manual, LightCycler 
FastStart DNA Master hybridization probe kit (Roche 
30 Diagnostics) was used to conduct quantification from 40ng 
of DNA, LightCycler apparatus (Roche Diagnostics) for 
real-time PCR. Fluorescent labeled probes for the 
resected products are as follows: 
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5'-CGGCCGCTCTAGCGGTACCCTAC-FITC-3' (SEQ ID NO: 41) and 
5' -LCReci640-GTAGGGGATCGACCTCGAGGGG-3MSEQ ID NO: 42), 
and fluorescence labeled probes for fragments are 
5' -GCTGTGCTCGACGTTGTCACTGAAG-FITC-3' (SEQ ID NO: 43) 
5 and ' -LCRed640-GGGAAGGGACTGGCTGCTATTGGG-3' (SEQ ID NO: 
44) . PGR primers for the excised products are as follows: 
5' -GTTGGGTCGTTTGTTCGGAT-3' (SEQ ID NO: 45) and 
5' -CGCGCAATTAACCCTCACTA-3' (SEQ ID NO: 46) . PGR primers 
for neo fragment are as follows: 

10 5' -AATGAACTGCAGGACGAGGC-3' (SEQ ID NO: 47) and 
5' -ATGGATACTTTCTCGGCAGG-3' (SEQ ID NO: 48). Four atto 
gram to 2 . Opg, and 0 . 2pg to 2 . Ong of control plasmid having 
a target sequence were used as a standard for the excised 
product and neo fragment, respectively. The 

15 amplification conditions were as follows: 95 degrees 
Celsius for ten minutes, followed by forty-five cycles 
of 95 degrees Celsius for ten seconds, 55 degrees Celsius 
for 10 seconds and 72 degrees Celsius for ten seconds. 
The excised product and neo fragments were quantified 

20 based on the standard curve, and the amount of excised 
product was divided by that of neo fragment for 
normalization. 

(RESULTS) 

25 Figures lA to IC show the results relating to 

that frequent Transposition is related to CpG methylation. 



(A) The introduction of a single copy of SB 
transposon into Sptlc2 locus by insertion-type homologous 
30 recombination and the detection of transposon excision 
by PGR. White square: exon, solid triangle: loxP site, 
gray and white arrows : nest id PGR primers, less dense line : 
backbone sequence of the plasmid, CAP: CAP promoter. 
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IR/DR-R and IR/DR-L: right IR/DR and left IR/DR, 
respectively^ Xb: Xbal, B: BamHI, K: Kpnl . 

(B) Excision of SB transposon in Sptlc2 locus 
5 of an ES cell. ES 115 clone (shown in panel A) having SB 
transposon inserted therein by targeting Sptlc2 locus, 
was transfected with sSBlO expressing the wild type of 
SB transposase (transposase +) or pSBlO-DDE expressing 
inactive SB transposase with deletion of DDE box 

10 (transposase-) (Ivies, Z., P. B. Hackett, R. H, Plasterk, 
and Z. Izsvak. 1997;Cell 91:501-510) in a serial dilution. 
As shown in Panel A, the resultant was screened with nested 
PGR. Four independent transf ections for each of dilution 
factors of pSBlO were conducted, and eight independent 

15 PGR was with 1 jxg of genomic DNA per reaction to conduct 
screening of 8fxg of genomic DNA per transf ection. Typical 
PGR results with maximum amount of pSBlO (2[xg) are shown 
in the left panel- The right panel shows the results of 
the average number of positive PGR for each transfection 

20 and the RT-PGR analysis at the expression level. Actb: 
beta-actin. 

(G) Southern blot analysis of methylation state in 
the transposon sequences of male mouse germ line cell and 

25 ES cell. EGFP was used as a probe. Genomic DNA derived 
from the germ line cell has not been digested by Hpall. 
This shows that the site has mostly been methylated. On 
the other hand, the genomic DNA from ES115 clone has been 
substantially completely digested. The presence of minor 

30 band (shown as asterisk) shows that a small fraction of 
Hpall site has been methylated in an ES cell- Solid 
circle: Hpall (H) or Mspl (M) site; X: Xhol site; 0.5 kb 
bands derived from Hpall-Mspl fragment in KM, KXM and 
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KXH lanes are shown with arrow. • 

The results of the present example is shown 
in Figure lD(b) . As is seen in the photography a cell in 
5 which methylated transposon has been introduced with a 
transposition enzyme, showed more often excision 
reactions than a cell with unmethylated transposons. 
Abbreviation: CAG: CAG promoter, EGFP: green fluorescent 
protein, pA: poly addition signal, L: left IR/DR, R: right 
10 IR/DR, M: methylation, N: non-methylation, NO: negative 
control (MEL cell, genomic DNA) . 

Accordingly, it was clarified that 
15 methylation allows significant- effects in the transposon. 

(EXAMPLE 2: establishment of a cell having 
methylated or unmethylated transposon in the same genetic 
locus of a murine genome) 

20 

Next, the present Example verifies whether 
methylation at the cellular level has an effect on 
transposition activity. 

25 (Methods) 

Two loxP sequences in reversed direction were 
constructed in a plasmid vector with a transposon located 
therebetween, and SssI CpG methylase was sub j ected thereto 
for methylation of the sequence encoding a transposon. 

30 Next, the plasmid DNA and the Cre recombinant enzyme 
expression vector were introduced to a MEL cell (strains 
RL5, RL6, E.E. Bouhassira, J. Mol. Biol. 292:779-785,1999) 
with hygromycin resistant gene and herpes viral thymidine 
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kinase (HSVTK) as foreign genes (transgene) between the 
loxP sequence in reversed direction. The Cre 
recombinant enzyme allows recombination between the 
plasmid and the loxP on the genome, resulting in the 
5 establishment of a cell having a transposon in a specific 
site on the genome. This allows efficient excision 
reaction due methylation in the same sites on the genome. 
SB transposase was introduced in to an established cell 
for detecting the excision reaction by PGR as described 
10 in Example 1. Figure 2A shows the experimental flow chart. 
In Figure 2A, solid arrow shows the loxP site, and white 
and gray arrows show the PGR primers. GMV indicates GMV 
promoter. 

15 (Excision . assay of stably integrated 

transposon) 

Excision assay of stably integrated 
transposon. For the excision assay of the stably 
integrated transposon, 2.0 X 10^ targeted ES cell clones 
20 or 1.0 X 10^ MEL cell clones were transfected with 2.0, 
0.67, 0.22, 0.074, 0.025, and 0.0082 fig of pSBlO, and 
genomic DNAs were extracted with a DNeasy kit 4 8 h 
post-transf ection . 

25 One microgram of genomic DNA was analyzed by 

PGR with the primers LGB2XL2 

(5 ' -TTCCAAAAGAAGTAGAGTGGAGAAGCAGTG-3 ' ) (SEQ ID NO: 49) 
and PGK2 ( 5 ' -AGGCCACTTGTGTAGCGCCAAGT-3 ' ) (SEQ ID NO: 50) 
for the targeted ES cell clone and with the primers TgTP-lU 

30 and TgTP-2L for MEL cell clones. Nested PGR with 1 m 1 of 
the first PGR product as a template was performed with 
the primers LCB2XL1 

(5'-CCAACCAAATACATTTAACATATTCTAGGT-3' ) (SEQ ID NO: 51) 
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and PGK4 (5 ' -GCTGCTAAAGCGCATGCTCCAGACTG-3 ' ) (SEQ ID NO: 
52) for the targeted ES cell clone and with the primers 
TgTP-2U (5 ' -TCTATCGCCTTCTTGACGAGTTCTTCTGAG-3 ' ) (SEQ ID 
NO : 8 ) and 

5 TgTP-3L ( 5 ' -CAAGCGCGCAATTAACCCTCACTAAAGG-3 M (SEQIDNO: 
9) for MEL cell clones. PGR conditions were 95 °C for 15 
min, followed by 30 cycles of 94°C for 1 min, 60°C for 
1 min, and 72 °C for 1 min, followed by a final extension 
at 72°C for 7 

10 min. To quantitate the expression level of the SB 
transposase, total RNA was sing TRIzol (Invitrogen) in 
accordance with the manufacturer's protocol and treated 
with RQl RNase-Free DNase (Promega) . One microgram of RNA 
was reverse transcribed with Superscriptll (Invitrogen) 

15 by using random hexamer primer in a total volume of 20 
\il in accordance with the manufacturer' s 
protocol^ and 2 pi of the reaction product was analyzed 
by PGR. PGR primers were 

5 ' -AATAGAACTGTTTGGCCATAATGACCATCG-3 ' (SEQ ID NO: 53) and 

20 5'-ATCCACATAATTTTCCTTCCTCATG-3' (SEQIDNO: 54) for the 
amplification of the SB transposase gene and 
5'-CAGGGTGTGATGGTGGGAATGGGTCAGAAG-3' (SEQIDNO: 55) and 
5' -TACGTACATGGCTGGGGTGTTGAAGGTCTC-3 ' (SEQIDNO: 56) for 
the amplification of the beta-actin gene that was used 

25 as an internal control. PGR conditions were the same as 
those used for the amplification of the excision product 
except that the number of cycles was 30 for the SB 
transposase gene and 18 for the beta-actin gene. 

30 (Results) 

Figure 2B shows a Southern blotting which 
shows a result of showing that the recombination is 
correctly performed. This figure shows restriction map 
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of each strain on the right hand side. Clones 5M1, 5M2, 
and 5M3 are cell lines with methylated transposons from 
RL5 introduced thereinto, and clones 5N1, 5N2 and 5N3 are 
cell lines with unmethylated transposons from RL5 
5 introduced thereinto. Further, 6M1 refers to a cell line 
with methylated transposons from RL6 introduced thereinto, 
and 6N1 refers to a cell line with unmethylated transposons 
from RL6 introduced thereinto. The bands having expected 
size have been detected. Accordingly, recombination via 
10 Cre-loxP are shown to properly arise. As used herein, 
EGFP was used as a probe. 

Further, it was confirmed, that the methylation 
introduced and non-methylation have been maintained. The 
15 results are shown in Figures 2C-D. Experiments were 
conducted by means of Southern blotting using methylation 
sensitive restriction enzyme Hpall (New England Biolabs) . 

The left hand and right hand panels show the results 
20 of restriction enzyme map and Southern blot , analyses of 
directions A and B, respectively. In any event, it was 
observed that methylation introduced was maintained from 
the fact of band pattern, and no new methylation was 
introduced in the case of non-methylation. 

25 

Figure 2E depicts production of a cell having 
methylated SB transposon or unmethylated SB transposon 
at the desired genetic locus of the genome. It shows a 
fluorescent activated cell sorting (FACS) profile for EGFP 
30 expression in a target clone. Gray region: wild-type cell. 
Light line: target clone. As shown in Figure 2E, 
enhancement of efficiency of transposon methylation was 
demonstrated in the protein expression level. 
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Both in RL5 derived clone and RL6 derived 
clone transfected with the methylated transposons, Hpall 
and Sail sites were neither digested. This shows that the 
5 methylation state has been maintained. In contrast, both 
Hpall and Sail sites have been completely digested in all 
the clones transfected with the unmethylated transposons, 
which shows that the unmethylated state has been 
maintained. Concurrent with these results, the 

10 expression of EGFP reporter in the transposon vector was 
suppressed in a cell comprising the methylated transposon, 
but not in a cell comprising the unmethylated transposon 
(see Figure 2E) . Both methylated and unmethylated states 
in the transposon region were maintained for at least 10 

15 weeks after the RMCE reaction. 

As such, it was shown that the enhancement of 
efficiency of transposon by' the methylation was also 
demonstrated in the protein expression level, and thus 
20 was demonstrated to be stably introduced therein. 

(Example 3: Effects of DNA methylation in the same 
site in the murine genome on a transposon excision 
reaction) 

25 The cell line established in Example 2, was 

introduced with SB transposase (PCMV-SB, available from 
P. Hackett, U. of Minnesota), and genomic DNA was 
recovered 48 hours thereafter. The collection of the 
genome was conducted using a genome extraction kit (DNeasy 

30 Tissue Kit, QIAGEN) . Nested PGR was used for detecting 
the excision of a transposon. Ten rounds of PGR reactions 
were conducted per cell line. One ^ig of genomic DNA was 
used as a template for reaction in the 1st PGR. The primers 
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used are shown as follows: 

1-1) TgTP-lU (SEQ ID NO: 6) 

1- 2) TgTP-2L (SEQ ID NO: 7) 

2- 1) TgTP-2U (TCTATCGCCTTCTTGACGAGTTCTTCTGAG; SEQ 
5 ID NO: 8) (2nd PGR; nested PGR) 

2-2) TgTP-3L (CAAGCGCGCAATTAACCCTCACTAAAGG; SEQ ID 
NO: 9) (2nd PGR; nested PGR) 

The thus obtained . sample (1 fig) was used as 
10 a template for the nested PGR for the following experiments. 
The genomic DNA of each cell line was used as a negative 
control (NC) . In the transposons with methylation 
introduced therein, significant increase (at least 10 
folds or greater) was found in the frequency of the 
15 excision response. 

Figure 3A is an example of showing effects of 
DNA methylation on the reaction of transposon excision 
in the murine genome . 

20 

Figure 3B-C depicts the details of frequent 
excision of methylated SB transposon in the predetermined 
locus of the genome. A portion thereof overlaps with 
Figure 3A. One micro gram or 10 nano gram of clones shown 

25 in Figure 2 have been used as a template for nested PGR. 
For each clone, 10 rounds of PGR have been performed. (B) 
Direction D; (G) Direction I. NG : genomic DNA which has 
not been transfected as negative control, M: 100 bp ladder. 
Right panel shows methylation state of the transposon 

30 region and the parent clone. 

In order to determine the effects of GpG 
methylation on the excision of transposon, the present 
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inventors have transfected a clone with a SB transposase 
expression vector^ and conducted a PCR for detecting the 
excision forty-eight hours after the transfection (see 
Figures 4A and 4B) . In the clones comprising 

5 unmethylated transposon, when 1 |ag of genomic DNA is used 
as a template, the excision was detected one in ten rounds 
. of PCR (Clones 5U1 or 5U2) or undetected (Clone 5U3 or 
6U1) . This frequency was similar to that observed in the 
ES cell having unmethylated transposon in the Sptlc2 locus 

10 (see Figure IB) . On the other hand, in the clones 
comprising the methylated transposon, the excision was 
detected in all ten reactions (Clones 5M1, 5M2, 5M3 and 
6M1) . When the genomic DNA was diluted for the first PCR, 
10 ng of the genomic DNA derived from the clone comprising 

15 the methylated transposon still presented positive signal 
(Figure 3B, Clones 5M3 and 6M1) . When the amount of the 
SB transposase expression vector showed reduction by about 
10 % for three consecutive rounds, the methylated 
transposon showed high excision in a consistent manner 

20 (data not shown) . This suggests that the expression level 
of the SB transposase is a cause of the change in excision 
frequency. This result shows that the excision 
efficiency of transposon is at least 100 times higher in 
the methylated transposon than the conventional. This 

25 result is consistent with the increase of about 100 times 
in excision efficiency in a methylated transposon, which 
was used for transient transfection (Figure IB) . 

As described above, it was demonstrated that 
30 the methylation of a portion encoding a transposon 
promotes excision reaction of a transposon. 

(Example 4: Effects of methylation on the 
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insertion of the transposon into the genome) 

In the present Example, the inventors 
conducted detection of the integration of a plasmid DNA 
containing transposon into the genome. Promoter trap 
5 type transposon was constructed. The method for this 
construction is briefly described herein below, Xhol 
fragment from pROSAjS geo was replaced with the EcoRI-BamHI 
fragment from pTransCX-GFP . 

10 Trap-type transposon was inserted inside into 

the gene of the murine genome, and if the framework of 
the amino acids are consistent, then the p-geo (pROSAPgeo 
(Genes & Development 5:1513-1523,1991} is expressed and 
the cell will be resistant to the antibiotic G418 

15 (Geneticin, Invitrogen) . The structure of the plasmid 
herein used is shown in Figure 4A. Abbreviations in 
Figure 4A are: SA, splice acceptor, p-geo: fusion gene 
with p-gal and neomycin resistant gene. 

20 The promoter trap type transposon was 

methylated using SssI CpG methylase, and was integrated 
into a murine embryonic stem (ES) cell (RIES cell, A. Nagy) 
with an SB transposase enzyme. Thereafter selection was 
conducted using G418 for seven days. After fixation with 

25 methanol, Giemsa staining was conducted (Nakalai Tesque, 
Kumamoto, Japan) . 

(SB transposase- and Tc3-mediated 
integration of promoter trap transposon) 
30 For SB transposase-mediated integration of 

the promoter trap transposon, 2.0 X 10^ ES cells were 
transfected with 1.6 ^g of methylated or unmethylated 
pTransSAP-geo together with 0.4 pg of pSBlO by using 
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TransFast and plated into 1 well of a 24-well plate. The 
transfected cells were then transferred to a 10-cm dish 
48 h posttransf ection and selected for 7 days with 150 
\ig of G418/ml, after which the colonies were Giemsa stained 
5 and counted. Several G418-resistant clones were expanded, 
and their DNAs and RNAs were extracted. The flanking 
sequences of integration sites were determined by 
ligation-mediated PGR as described previously (Horie, K. , 
A. Kuroiwa, M. Ikawa, M. Okabe, G. Kondoh, Y. Matsuda, 
10 and J. Takeda. 2001; Proc. Natl. Acad. Sci. USA 
98 : 9191-9196) and were analyzed by using the Ensembl mouse 
genome database (version 16.30.1). Promoter trap events 
were verified by reverse 

transcription (RT)-PCR with the transposon-specif ic 
15 primer p-geo ( 5 ' -TGCCAGTTTGAGGGGACGACGACAGTATCG-3 ' ) 
(SEQ ID NO: 57) and the gene-specific primers 
5'-TGGAGTGAGCTAGAATCAGAAAGATGACAC-3' (SEQ ID NO: 58) for 
M2S^ 5' -GACTTTCAAGACCTTCGACGCACCGTTCAC-3 • (SEQ ID NO: 
5 9 ) for M2L, 5 ' -TCTTCAGCCACAGGCTCCCAGACATGACAG-3 ' ( SEQ 
20 ID NO: 60) for M4 ^ and 

5'-GATATGAAGAGCTGTCAGTTTGTAGCAGTC-3' (SEQ ID NO: 61) for 
Nl. PGR products were directly sequenced- For Tc3-mediated 
integration of the promoter trap transposon, 2.0 X 10^ 
ES cells were transfected with 1.0 pg of methylated or 

25 unmethylated pTc3/SAp-geo together with 1 . 0 ijg of pRP2302 
(Fischer, S.E., E.Wienholds, and R. H. Plasterk. 2001; 
Proc. Natl. Acad. Sci. USA 98:6759-67 64), an expression 
vector of Tc3 transposase, by using TransFast. 
Transfected cells were passaged and selected by G418 under 

30 the conditions described above. 

Figure 4B shows the results. As is seen from 
the number of blue spots on the disc, it was revealed that 
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the methylated transposon significantly increased the 
number of G418 resistant colonies than the unmethylated 
transposon. 

5 Figure 4B depicts" effects of CpG methylation 

on the SB transposition. Average values of G418 resistant 
colony per culture dish against three independent 
transf ections are shown in the right panel. Error bar 
shows standard deviation. As shown, CpG methylation 
10 significantly enhances introduction of a gene. 

Figure 4C depicts the structure of a gene 
trapped and transposon insertion site. The insertion 
site is shown with black arrows. Clones M1-M4 are derived 

15 from transfection with methylated transposon, and Clones 
N1-N6 are derived from non-methylated transposon. The 
number of chromosomes. Ensemble gene designator, and the 
gene nomenclature are also shown. Except Clone M2, in 
which two insertion site have been characterized (called 

20 M2S and M2L) , one insertion site has been identified for 
each clone. Correct splicing in some clones (M2S, M2L, 
M4 and Nl) have been confirmed with RT-PCR using primers 
to the upstream exon (white arrow) and transposon. Solid 
square: exon. Scale bar for 5kb is shown in the right hand. 

25 

Figure 4D shows effect of methylation on Tc 
transposition. Construct of trapping vector is shown in 
the upper panel. TIR: terminal inverted repeat. Three 
independent transf ections are shown in the lower panel 
30 with respect to the average number of G418 resistant colony 
per dish. 



As shown in Figure 4, the effects of 
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methylation on the complete transposition was 
investigated. As described above, the excision of 
transposons from the genomic locus (the first step of the 
transposition reaction) was enhanced by the methylation 
5 of CpG. The excised transposon fragment needs to be 
reintegrated into the genome to complete the transposition 
reaction. We therefore compared methylated and 
unmethylated transposons in terms of the number of 
transposons integrated into the genome. Since CpG 

10 methylation inhibits promoter activity, the expression 
cassette of selection markers within a transposon could 
not be utilized to detect transposition into the genome. 
Instead, we employed a gene trap scheme (Friedrich, G., 
and P. Soriano. 1991; Genes Dev. 5 : 1513-1523) by 

15 constructing a gene trap-type transposon vector, 
pTransSAp-geo, which contained a splice acceptor site 
upstream of p-geo (Fig. 4A) . When the transposon is 
integrated into an active gene, a chimeric fusion 
transcript consisting of an endogenous gene and |3-geo is 

20 thought to be generated. If p-geo is in frame with the 
trapped gene, cells become resistant to G418 due to the 
expression of p-geo (Fig. 4A) . The methylated or 
unmethylated transposon vector was introduced into ES 
cells together with the SB transposase expression vector 

25 and selected with G418 . The methylated transposon produced 
1, 600 colonies, while the unmethylated transposon yielded 
140 colonies (Fig. 4B) . This result indicates that the 
efficiency of overall transposition, which includes both 
excision and integration, is enhanced 11-fold in the 

30 methylated transposon- Transposon insertion sites were 
determined by ligation mediated PGR, and gene trap events 
were verified in randomly selected clones by RT-PCR 
analysis (Fig. 4C) , and the results obtained indicated 
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that vector methylation did not affect gene trap 
selection. 

To test the effect of methylation in a 
5 different transposon system, we constructed another gene 
trap-type vector, pTc3/SAp-geo, by using the Tc3 
transposon derived from C. elegans (Fischer, S. E., H. 
G. van Luenen, and R. H. Plasterk. 1999.; Mol . Gen. Genet. 
262: 268-274.; and Fischer, S. E.,E. Wienholds, and R. 

10 H. Plasterk. 2001; Proc. Natl. Acad. Sci. USA 
98:6759-6764) and introduced this vector with or without 
methylation into ES cells together with the Tc3 
transposase expression vector. CpG methylation increased 
gene trap events two fold (Fig. 4D) . This result indicates 

15 that the enhancement of transposition by CpG methylation 
is seen in other transposon systems, although the extent 
of enhancement varies. 

Accordingly, it was demonstrated that 
methylation significantly enhances the integration of 

20 transposon into the genome. 

(Example 5: Expression of Protein) 
(Protein Expression and EMSA) 
The N123 peptide comprising IR/DR recognition 
25 domain was expressed and purified as described previously 
(Ivies, Z., P.B. Hackett, R. H. Plasterk, and Z. Izsvak. 
1997; Cell 91: 501-510). The oligonucleotide sequence 
for the oligonucleotide probe corresponding to the outer 
binding sites of IR/DR-L were, for unmethylated probe: 
30 5' -TACAGTTGAAGTCGGAAGTTTACATACACTTAAG-3' (Unmet-U) (SEQ 
ID NO: 62) and 5' -CTTAAGTGTATGTAAACTTCCGACTTCAACTGTA-3' 
(Unmet-L) (SEQ ID NO: 63), and for methylated probes: 
5 ' -TACAGTTGAAGTOGGAAGTTTACATACACTTAAG-3 ' (Met-U ) ( SEQ 
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ID NO: 64) and 5' -CTTAAGTGTATGTA7^CTTCOGACTTCAACTGTA-3' 
(Met-L) (SEQ ID NO: 65) (as used herein the symbol "O" 
refers to 5-methyl cytosine) . This oligonucleotide was 
first labeled with [y-'^^PlATP using T4 polynucleotide 
5 kinase, and then annealed. DNA fragment of IR/DR-L was 
digested with Hindlll and Kpnl for the preparation, and 
thereafter, the fragment was methylated using SssI 
methylase or kept unmethylated and then the fragment was 
treated with an alkaline phosphatase. The digested 

10 fragment was extracted using phenol-chloroform, and 
precipitated with ethanol, and then finally, T4 
polynucleotide was used to label with [y-^^PlATP. In the 
presence of poly(dl-dC) (1.0]ag), 0.2 pmol of 
oligonucleotide probe or 100 pg of IR/DR-L fragment, and 

15 l.Opl of N123, a nucleoprotein complex was formed in a 
buffer described previously in Ivies, Z., P. B. Hackett, 
R- H. Plasterk, and Z. Izsvak. 1997; Cell 91:501-510). 
After a thirty minute incubation at 25 degrees Celsius, 
samples were loaded on a 5%-native polyacrylamide gel, 

20 and electrophoresed at 150V constant voltage, and in 
0 . 5xTris-borate-EDTA, 1.5 hours for oligonucleotide 
probe and 2.5 hours for IR/DR-L fragment. After the gel 
was dried, the nucleoprotein was visualized by 
autoradiography . 

25 

(ChIP Assay) 

According to the manufacturer's protocol. 
Chip assay kit (Upstate) was used for chromatin 
immunoprecipitation (ChIP) . The antibodies used in the 
30 present study were anti acetyl H3 (Upstate) and 
anti-trimethylated H3K9 (Abeam) . Precipitated DNA was 
used for analysis by PGR in a plurality of loci, for amylase 
2.1: primers 5' -CCTTGTACGGGTTGGTGGAGGTCAC-3' (SEQ ID NO: 
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66) and 5' -CGCCACTCGAACAGGTGGACAATAG-3' (SEQ ID NO: 67), 
for p-small globin gene, 

5' -TGCGAGGATAAGAACAGACACTAC-3' (SEQ ID NO: 68) and 
5'-ACAGACTCAGAAGCAAACGTAAGA-3' (SEQ ID NO: 69); for EGFP 
5 gene, EGFP-IU and EGFP-IL, for IR/DR-L, 

5' -GCACGGGTGTTGGGTCGTTTGTTC-3MSEQ ID NO: 70) and 
5' -CTTCTAAAGCCATGACATCATTTTCTG-3' (SEQ ID NO: 71), and 
for IR/DR-R, 5' -GAAGGCTACTCGAAATGTTTGACCCAAG-3' (SEQ ID 
NO: 72) and 5' -CAAGCGCGCAATTAACCCTCACTAAAGG-3' (SEQ ID 

10 NO: 9) are used. PGR conditions used are as follows: 95 
degrees Celsius for 15 minutes using HotStarTaq, followed 
by 35 cycles of 94 degrees Celsius for thirty seconds, 
60 degrees Celsius for thirty seconds, and 72 degrees 
Celsius for one minute, and finally the final extension 

15 at 72 degrees Celsius for seven minutes . In order to allow 
semi-quantitative analysis, 1 v^l, 0.2 ]al, 0.04 \il and 
0.008 111 of input DNA per reaction was used, and 1 \il and 
0.2 yl of DNA were precipitated per reaction. For EGFP, 
IR/DR-L and IR/DR-R in the fractions of anti-trymethylated 

20 H3K9, 5 111 and 1 pi of DNA were used per reaction. 

Figure 6 shows invariant affinity of SB 
transposon DNA binding domain against naked methylated 
IR/DR. 

25 

Figure 6A shows an exemplary drawing of CpG 
sites in the IR/DR and transposase binding sites. Solid 
circle: CpG sites. 

30 Figures 6B-E shows electrophoretic mobility 

shift assay (EMSA) in the recombinant SB transposase 
peptide (N123) . 
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Figures 6B and 6C shows results obtained in 
the external binding site of IR/DR-L 34 bp, and Figures 6D 
and 6E show the results obtained by IR/DR-L fragment of 
300 bp. 

5 

As shown in Figure 6B, a concentration of N123 
peptide sufficient for enhancing non-methylated or 
methylated external binding sites (1600 to 100 dilution 
of the purified peptide) mixed therewith to form a 
10 nucleoprotein complex. 

As shown in Figure 6C, non-methylated 
external binding site has been labeled for use as a probe, 
and non-methylated external binding site or methylated 
15 external binding site have been used as competitive 
substance in an increased concentration (10-50 folds molar 
excessive probes) . 

As shown in Figure 6D, non-methylated IR/DR-L 
20 fragment or methylated IR/DR-L fragment have been mixed 
with an increased concentration of N123 peptides (5100-160 
folds dilution of the purified peptides) for formation 
of nucleoprotein complex. 

25 As shown in Figure 6E, non-methylated IR/DR-L 

fragment was labeled for use as a probe, and non-methylated 
or methylated IR/DR-L fragments have been used as a 
competitive substance in an increased concentration 
(500-8000 folds molar excessive probes). Unmet: 

30 non-methylated. Met: methylated, F: free probe, C: complex. 
Complexes 1 (CI) and 2 (C2) indicate one and two 
molecule (s) of N123 peptides per IR/DR, respectively 
(Ivies, Z., P. B. Hackett, R. H. Plasterk, and Z . Izsvak. 
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1997; Cell 91: 501-510). 

Figure 7 shows ChIP assay of methylated 
transposon or non-methylated transposon in a 
5 predetermined locus . 

Figure 7A shows SB transposon region, which 
was analyzed in the ChIP assay. PCR amplification region 
is shown as light line under each component. Solid 
10 triangle: lox 511 site. 

Figures 7B and 7C show PCR analysis of 
precipitated DNA. As shown in Figures 3 and 4, Clones 5M3 
and 5U3 (B) and 6M11 and 6U1 (C) are derived from parent 

15 clone RL5 and RL6, respectively. Input and precipitated 
DNA have been analyzed in a five-fold stepwise dilution. 
AcH3: anti-acetylated H3, MeH3K9; anti-trymethylated 
H3K9, no Ab: control having no antibodies. Amylase 2.1 
and beta-small globin have been used as a typical control 

20 against heterochromous region and euchromatin region . 
Intensity of bands of amylase 2.1 and beta-small globin 
are the same in the anti-acetyl H3 fraction, whereas the 
higher amplification efficiency than amylase 2 . 1 sequence 
in the input DNA, indicates that beta-small globin is 

25 enriched compared with the amylase 2.1 sequence in this 
fraction. Minor bands found in the control lanes without 
antibodies are derived from non-specific binding of the 
genomic DNA against the protein. Agarose beads are used 
for preparation. Determination has conducted twice. 

30 Representative results are shown. 

As shown in Figures 6 and 7, unchanged 
affinity of the SB transposase DNA binding domain to naked 
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methylated IR/DR was demonstrated in the present 
invention. 

In order to clarify the mechanism of 
5 transposition which has been enhanced by methylation of 
CpG, the present inventors have compared the affinity of 
recombinant SB transposase peptide against the methylated 
IR/DR and unmethylated IR/DR by EMSA. Both right and left 
IR/DR have two binding sites against the SB transposase ~ 

10 (Figure 6A) . Since only outer binding site comprises CpG 
sequence (Figure 6A) , the binding site of the outer left 
was used as a probe and antagonist. The first 123 amino 
acids (N123) of SB transposase, which was previously 
reported to comprise IR/DR recognition domain (Ivies, Z. , 

15 P. B. Hackett, R. H. Plasterk, and Z. Izsvak. 1997; Cell 
91: 501-510), was used to express in Escherichia coli, 
to purify by means of histidine tag at the C-terminus, 
and to produce a nucleoprotein. If unmethylated or 
methylated binding sequence was used as a probe, the 

20 intensity of shifted bands are comparable (Figure 6B) . 
When unmethylated or methylated site was used as an 
antagonist against a labeled unmethylated probe, no 
difference was observed (Figure 6C) . The same experiment 
using the entire region of the left IR/DR fragment was 

25 used as a probe and an antagonist, no action of methylation 
was observed (Figures 6D and 6E) . These results indicate 
that the CpG methylation of IR/DR does not change the 
direct binding of N123 peptide against a naked IR/DR, and 
indicated that it may be necessary that native chromatin 

30 structure in the transposon region enhances the 
transposition . 

As shown in Figure 7, chromatin structure in 
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the methylated and non-methylated transposons was 
confirmed. 

DNA recombination reaction may be affected by 
5 the change in higher dimension of structure of the DNA 
substrate. For example, E.coli -protein HU stimulate 
recombinant efficiency in bacteriophage Mu by binding to 
transposase binding site. Similar action of high mobile 
group Bl (HMGBl) protein relating to SB transposition was 

10 recently reported. The present inventors have therefore, 
investigated chromatin structure in the methylated and 
unmethylated transposon in the MEL cell clone using ChIP 
assay. It was shown that the modification of the histone^ 
tail played an important role in regulating chromatin 

15 structural machinery and gene expression in a higher stage. 
Recent studies shows that acetylated histone H3 is located 
in euchromatin where DNA is roughly packaged and 
methylated histone H3 was located onto heterochromatin 
by nine lysine residues (H3K9) , where DNA was concentrated 

20 (Grewal, S. I., andS. C. Elgin. 2002; Curr. Opin. Genet. 
Dev. 12: 178-187). The inventors therefore used an 
antibody recognizing acetyl H3 or trimethyl H3K9 in the 
Chip assay to compare methylated transposon regions of 
two different loci and chromatin structures of 

25 unmethylated transposon using RL5 derived clone and RL6 
derived clone (Figure 7) . As an internal standard for the 
Chip assay, pancreas amylase 2.1 gene was used as a 
heterochrome marker, and beta-small globin gene was used 
as a euchromatic marker (Dhar, V., A. I. Skoultchi, and 

30 C.L.Schildkraut . 1989. ; Mol. Cell. Biol. 9: 3524-3532; and 
Schubeler, D., C. Francastel, D. M. Cimbora, A. Reik, D. 
I. Martin, andM. Groudine. 2000; Genes Dev. 14: 940-950) . 
As shown in Figure 7, in proportion of those enriched for 
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acetyl H3^ the amount of amylase 2.1 gene in the fraction 
enriched for trimethylated H3K9^ was significantly higher 
than that for beta-small globin, which concurs the 
expectation that amylase 2.1 gene and beta-small globine 
5 gene are located in heterochromatin and euchromatin, 
respectively. Enrichment of transposon sequence 

(IR/DR-R, EGFP and IR/DR-L) in the acetyl H3 fraction was 
observed in the clones comprising unmethylated 
transposons (5U3 and 6U1) . In contrast, enrichment in the 

10 trimethylated H3K9 fraction, the clones comprising 
methylated transposon (5M3 and 6M1) have been observed. 
The enrichment was closely correlated to the expression 
pattern of the GFP reporter (Figure 3D) . These results 
indicate that the methylated transposon region and 

15 unmethylated transposon region have involved in formation 
of heterochromatin and euchromatin, respectively. 

As such, the transposition efficiency of a 
transposon was significantly enhanced in the protein 
20 expression level. Further, the mechanism therefore was 
also elucidated. 

(Example 6: production of a transgenic mouse) 
A system according to Example 4 was used to produce 
25 a transgenic mouse. In brief, ES cells obtained in 
Example 4 was injected into the blastcyst and the same 
was returned back to the uterus of a pseudo pregnant mouse 
to produce a mouse. This mouse has also been demonstrated 
that the methylation against the insertion of the 
30 transposon into the genome has significantly enhances the 
efficiency. 



(Example 7: Examples using naturally 
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occurring embodiments ) 

Next, it was demonstrated that the 
methylation is useful for production of a transgenic 
biological organism. In brief, a mouse was produced by 
5 means of simultaneously injecting a methylated transposon 
DNA and a transposase RNA into a fertilized egg. This 
mouse also demonstrated that the methylation 
significantly enhances the efficiency by the insertion 
of a transposon into the genome. 

10 

(Example 8: Examples of different 

transposons ) 

Next, as one of Tcl/mariner, Tc (Accession No • 
X01005) was used in lieu of SB system used in Example 1, 
15 to conduct the same experiments as in Examples 1-3. This 
mouse also demonstrated that the methylation 
significantly enhances the efficiency by the insertion 
of a transposon into the genome. 

20 (Example 9: Example of a different 

transposon) 

Next, as one of minos-2 (Accession No . Z29098) 
was used in lieu of SB system used in Example 1, to conduct 
the same experiments as in Examples 1-3. This mouse also 
25 demonstrated that the methylation significantly enhances 
the efficiency by the insertion of a transposon into the 
genome . 

Although certain preferred embodiments have 
30 been described herein, it is not intended that such 
embodiments be construed as limitations on the scope of 
the invention except as set forth in the appended claims. 
Various other modifications and equivalents will be 
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apparent to and can be readily made by those skilled in 
the art, after reading the description herein, without 
departing from the scope and spirit of this invention. 
All patents, published patent applications and 
5 publications cited herein are incorporated by reference 
as if set forth fully herein. 

INDUSTRIAL APPLICABILITY 
The invention is used for allowing efficient 
10 production of a transgenic biological organism (in 
particular, vertebrates such as mammals) . Such a 
biological organism is extremely useful as a model animal, 
screening, pharmacological tests and the like. 



